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Preface 


The first, second, and third editions of this book seem to occur at ten year intervals. The intent 
is to keep the book up-to-date. Many-body theory is a field which continually evolves in time. 
Journals only publish new results, conferences only invite speakers to report new phenomena, 
and agencies only fund scientists to do new physics. Today’s physics is old hat by tomorrow. 
Students want to learn new material, and textbooks must be modified to keep up with the 
times. 

The early chapters in this book teach the techniques of many-body theory. They are 
largely unchanged in format. The later chapters apply the techniques to specific problems. 
The third edition increases the number of applications. New sections have been added, while 
old sections have been modified to include recent applications. 

The previous editions were set in type using pre-computer technology. No computer file 
existed of the prior editions. The publisher scanned the second edition and gave me a disk 
with the contents. This scan recorded the words accurately and scrambled the equations into 
unintelligible form. So I retyped the equations using LaTeX. Although tedious, it allowed me 
to correct the infinite numbers of typographical errors in the previous edition. The earlier 
typesetting methods did not permit such corrections. The entire book was edited sentence-by- 
sentence. Most old sections of the book were shortened by editing sentences and paragraphs. 

I also contemplated removing entirely some old sections. Each time I did this, and told 
somebody, they always remarked that the deleted section was their favorite, and I simply 
could not remove it. While it is gratifying to have so many sections be everyone’s favorite, it 
does make shortening the book somewhat hard! In the end I gave up, and no sections were 
removed. Many were rewritten to shorten them. Since many new sections were added, the 
book gets longer with each edition. The reference list was updated. 

New sections include: Bethe lattice, different mean-free-paths, Hubbard model, 
Coulomb blockade, Landauer transport, and the Quantum Hall effect. The big problem is 
what to say about high-temperature superconductivity. Although much experimental infor- 
mation is available regarding this important topic, the theoretical picture is quite uncertain. 
There is no agreed understanding of the pairing mechanism which causes the high transition 
temperature. It is hard to write a text book on a topic for which there is little agreement 
regarding fundamental theory. In the end, I mentioned only some important experiments and 
their results, and added little new information on the theory mechanisms. The section on the 
gap equation was rewritten to use the modern method of solving it in complex frequency 
space, rather than the older method of real frequency space. 


vV 


vi Preface 


I thank Steve Girvin for his proofreading twice the various versions of the section on the 
Quantum Hall effect, and Koung-An Chao for teaching me about quantum dots. I also very 
much thank my wife Sally for letting me spend every evening and weekend for one year 
preparing this new edition. 
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Chapter 1 
Introductory Material 


1.1. HARMONIC OSCILLATORS AND PHONONS 


First quantization in physics refers to the property of particles that certain operators do not 
commute: 


[x, Py] = ih (1.1) 


ð 
E > ih— 1.2 
> he (1.2) 


Later it was realized that forces between particles were caused by other particles: photons 
caused electromagnetic forces, pions caused some nuclear forces, etc. These particles are also 
quantized, which leads to second quantization. The basic idea is that forces are caused by the 
exchange of particles, and the number of particles is quantized: one, two, three, etc. The 
quantization imparts a quantum nature to the classical force fields. 

In solids the vibrational modes of the atoms are quantized because of first quantization 
(1.1.1). These quantized vibrational modes are called phonons. An electron can interact with a 
phonon, and this phonon can travel to another electron, interact, and thereby cause an indirect 
interaction between electrons. Indeed, the phonon need not move but can vibrate until the next 
electron comes by. The induced interaction between electrons is an example of second 
quantization. The phonons play a role in solids similar to the classical fields of particle 
physics. They cause quantized interactions between electrons. 

Phonons in solids can usually be described as harmonic oscillators. A fuller description 
of the effects of anharmonicity is introduced later. But, for the moment, this idea should be 
sufficient motivation to study the harmonic oscillator. The one-dimensional harmonic oscil- 
lator has the Hamiltonian 


2 
H=% + 


K 
2m 2 


x (1.3) 


anh 
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To solve this Hamiltonian, introduce a dimensionless coordinate €: 


o = E (1.4) 
m 
= (1.5) 
_9__P 
iE = (1.6) 
and 

hal Ë y 

n= (5e) (1.7) 


The harmonic oscillator Hamiltonian has a solution in terms of Hermite polynomials. The 
states are quantized such that 


AW, = ho(n + 5) (1.8) 


where n is an integer. Use Dirac notation for the eigenstates |n) = W,. One can also learn by 
direct calculation that the following matrix elements exist for the operators x and p: 


h 1/2 
(n'|x|n) = (5) M Span +O Swan] 


2mo 
1.9 
mho) 1/2 1/2 2 
pin = (SP) O Bun ua 
It is customary to define two dimensionless operators as follows: 
1 ð may !/2 ip 
eoat) G) Co) 

(1.10) 


1 ð m@) !/2 i 
po fe 2% )\_ (=) P 
a al’ x) 2h ( 3 
They are Hermitian conjugates of each other. They are sometimes called raising and lowering 


operators, but here they are called creation (at) and destruction operators (a). The Hamil- 
tonian (1.7) may be written with them as 


H= nO aa’ + ata] (1.11) 
ho | 1 0 o 1 ð 0 
-Fpa k-z) te-a) (E+) a% 
hof & 
n=7 (i+) 8 
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A very important property of these operators is called commutation relations. These are 
derived by considering how they act, sequentially, on any function f (&). The two operations a 
and a’ in turn give 


i 0 0 i y 
aa'f® =E- EOE) (1.14) 
while the reverse order gives 
ol 0 0 olez, p m 
daO = ie- iNO =E- (1.15) 
These two results are subtracted, 
laa’ -d'd fO =G (1.16) 


and yield the original function. The operator in brackets is replaced by a bracket with a 
comma, 


[aad — ata] = [a, at] (1.17) 


which means the same thing. The relationship (1.16) is usually expressed by omitting the 
function f (6): 


[a, a] = 1 (1.18) 

In a similar way, one can prove that 
[a,a] = 0 (1.19) 
[at, a] = 0 (1.20) 


These three commutators, plus the Hamiltonian 
h h 
H= = [aat + ata] = S aat — ata + 2ata] = holatat 5] (1.21) 
completely specify the harmonic oscillator problem in terms of operators. With these four 


relationships, one can show that the eigenvalue spectrum is indeed (1.8), where n is an integer. 
The eigenstates are 


(a‘y” 
n 0 1.22 
|n) Tal |0) (1.22) 

where |0) is the state of no phonons which obeys 
al0) =0 (1.23) 


and where the n! is for normalization. Operating on this state by a creation operator gives 


; _ (att! _ (n 4 1)! bel 
a |n) Saal |0) Jaan” ) = 10) (1.24) 


=(n+1)'"|n +1) (1.25) 
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the state with the next highest integer. The only matrix element between states are 
(n'at ln) = (n + 1) Snan (1.26) 
(n'|a|n) = (m) 8 yan (1.27) 


The second expression is derived from the first by taking the Hermitian conjugate of the first, 
and then exchanging dummy variables n and n’. Alternately, 


aln) = (n)! in — 1) (1.28) 
So the destruction operator a lowers the quantum number. Then operating by the sequence 
atajn) = a(n) In — 1) = (n)'7 al |n — 1) = njn) (1.29) 


gives an eigenvalue n, which verifies the eigenvalue (1.8). Furthermore, using the original 
definitions (1.10) permits us to express x and p in terms of these operators, 


1/2 

x= (5) (a+a') (1.30) 
1/2 

p=i(™") -a (1.31) 


and the matrix elements (1.9) follow immediately: 


h 1/2 
(n!|x|n) = (5) [(n'laln) + (n'la In)] 


} 1/2 
(5) (ORLE + (n + 1)" Snan] 


[mho 1/2 1/2 1/2 
=] > [(n + 1) On'=n41 = (n) On'=n—11] 


The description of the harmonic oscillator in terms of operators is equivalent to the 
conventional method of using wave functions W,,(€) of position. 

The time dependence of these operators is often important. In the Heisenberg repre- 
sentation of quantum mechanics, the time development of operators is given by (A = 1) 


O(t) =e" Oe (1.32) 
so that the operator obeys the equation 

dO(t 

(220 = ilH, O(t)] (1.33) 


For the destruction operator, this equation becomes 


a 
z4 = ilH, a] = iofa'aa — aa'a] = ioja’, ala = —iwa (1.34) 
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which has the simple solution 

a(t) =e'a (1.35) 
The reference point of time may be selected arbitrarily, so that the operators have an arbitrary 
phase factor associated with them. This phase is unimportant, since it cancels out of all final 
results. The Hermitian conjugate of this expression is 


al (t) = eal (1.36) 


The time development of the position operator can be represented as 


1/2 
x(t) = (5) (ae + ale!) (1.37) 
~— \2mo l 


This result for x(t) will be used often in discussing phonon problems. 
Another familiar problem which can be solved with operators is a charged harmonic 
oscillator in a constant electric field F: 


2 
K 
H = +57 + eFx = o(ata +1) + Ma ta’) (1.38) 
% \ 12 
_ oF 1.39 
A= eF (sr) (1.39) 


This Hamiltonian may be solved exactly. First consider the equation of motion for the time 
development of the destruction operator: 


0 
= — ilH, a] = —i(@a +) (1.40) 
The right-hand side is no longer just proportional to a, since there is the constant term. 
However, let us define a new set of operators by the relationships 


À 
A= — 1.41 

a+— (1.41) 

O À 
A =a ++ (1.42) 

O 

They obey the equation 
ðA 


so they have the simple time development 
A(t) =e "A (1.44) 


AA) = d% (1.45) 
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Indeed, one can show that they have the following properties: 


[4, At] = jaat] =1 (1.46) 
[A, A] = 0 (1.47) 
[At At] = (1.48) 
#=0|(4'-*) (4-2) +5] (4 +4 -22) (1.49) 
O) O 2 A) 
o naL NYX 
= o(4 A+ 5) 5 (1.50) 


It was remarked above that any set of operators with these properties had a solution in terms 
of harmonic oscillator states: 
I\ A 
Hin) = | of n+- | —-— |in) (1.51) 
2 O; 


|0) (1.52) 


The operator for the position is 


1/2 
x(t) = (5) (a + Atel — =) (1.53) 


The physics of the Hamiltonian (1.38) is very simple. The spring stretches to a new equili- 
brium point which is displaced a distance 


h N\A eF 
__ ch Z 1.54 
*0 (5) wo K (1.54) 


from the original one. It oscillates about this new equilibrium with the same frequency @ as 

before. These oscillations are still quantized, in units of œ. The energy —A”/@ = —e?F? /2K 

is that gained by the spring from the displacement along the electric field. One can get the 
same result directly in coordinate space. The Hamiltonian is written as 

2 2 2 

H=2 +1 (x =) _ EF (1.55) 

2m 2 


and a new coordinate x’ = x + eF/K is defined which obeys 
[x', p] = ih (1.56) 
The variable x’ describes the simple harmonic motion. This result completes the discussion of 
the harmonic oscillator in an electric field. 
In a solid there are many atoms, which mutually interact. The vibrational modes are 


collective motions involving many atoms. A simple introduction to this problem is obtained 
by studying the normal modes of a one-dimensional harmonic chain: 


2 
=P +É was)? (1.57) 
i 2m 2 i i+ 
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The classical solution is obtained by solving the equation of motion: 


A solution is assumed of the form x; = x9 cos(kaj) and the force term becomes 


2x; — Xj41 — Xj—1 = Xql2 cos(kaj) — cos(kaj + ka) — cos(kaj — ka)| 


= 2x, cos(kaj)[1 — cos(ka)] 


The normal modes have the solution 


oO; = 2K [1 — cos(ka)] = 4K in? (5) (1.59) 
m m 2 


The quantum mechanical solution begins by defining some normal coordinates, assuming 
periodic boundary conditions: 


1 ikal 1 —ikal 
X; = ——=— e" X13 x, =—),e"%x 
E y= ey 
I —ikal l ikal 
= —— e l ; = — e' 1.60 
P= Xe Ph Pk 7 2e" Pi (1.60) 


This choice maintains the desired commutation relations in either real space or wave vector 
space: 


[xr Pm] = IÒ im (1.61) 
l —i ik'am 

[xk Py] = Noe kal o'k [xz Pm] (1.62) 

= SD eE- = 10K. y (1.63) 


From the general result 
l . . . 
y XX pm =< > XpXp 5 eiallkt+k ) gimak’ — y XX4 giamk 
N kk ] k 
Pi = L PP- 

It is easy to show that the potential energy term in (1.57) is 

K K j BF m 

> 2 (x; - Xy) =5 2 xx (2 — e" — e- ™) = > D ORX k 

j k k 

The Hamiltonian may be written in wave vector space as 


1 
H =~ > [pep_g +m wpx,x_z] (1.64) 
k 
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The Hamiltonian has the form of a simple harmonic oscillator for each wave vector. Define 
the creation and destruction operators as 


mop 1/2 i 
a= (oF) (a Tma +) ue) 


p_ [MoA i 
=)" ee) 0 


They obey the commutation relations 


[a;., aj] = -zz (le Pel — [Pr Xe ]} = On (1.67) 
[a,, ay] = 0 (1.68) 
[a}, at] = 0 (1.69) 


and the Hamiltonian may be written as 


H = 3 olala, + 3] (1.70) 


These collective modes of vibration are called phonons. They are the quantized version of the 
classical vibrational modes in the solid. These are the same commutator relations, and 
Hamiltonian, as in the simple harmonic oscillator. Each wave vector state behaves indepen- 
dently, as a harmonic oscillator, with a possible set of quantum numbers n, = 0, 1, 2, ... The 
state of the system at any time is 


(a;,)" 
W = |n, n,- np) = Mln) = Oy -= |0) (1.71) 
ke 
so that the expectation value of the Hamiltonian is 
k 


In thermal equilibrium the states have an average value of n} which is given in terms of the 
temperature B = 1/k,T: 


(ny) = N; = = nz(@,) (1.73) 


1 
eo% — 1 


The system fluctuates around this average value. 
The position operator in wave vector space, and real space, is 


1/2 
x(t) = G=) (ape ™ + a! ,e'*") 


h NË a | 
ft) = l —10,¢ t 10), 
x(t) > ( soa) e1 (ae +a_,e™) (1.74) 
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Often mN is replaced by the equivalent quantity mN = pv, where p is the mass density and v 
is the volume. At some point there is a summation over the discrete set of eigenstates for the 
system of finite volume v. It is convenient to change the summation to an integration: 


Jim <6) = [Z 


1.75 
ny (1.75) 


During this change, any delta functions must change from discrete delta functions (called 
Kronecker deltas) to continuous ones. Since 


L f(k’) = ED SWS, v= a af WB (1.76) 
_ Qn) 
; Z 3 ) (1.77) 
it is concluded that 
lim ôk -20 8(k — k’) (1.78) 


In general, our preference is to write wave vector summations as discrete summations until it 
is time to do the integrals and only then make the changes (1.75). 

The quantum mechanical solution has the same frequencies as found in the classical 
solution. Quantum mechanics only enters in a quantization of the amplitude of the oscillation. 
The phonons occur in discrete numbers with zero, one, two, etc., phonons in each state k. 
When the average number of phonons is large, n, >> 1, the quantization is irrelevant, since 
the system behaves classically. The quantum nature of the field is more important when the 
average number of phonons in each state k is small. 

In three-dimensional solids, the theory is nearly identical except there are more indices. 
Suppose there is a potential function between atoms or ions of the form 


>» VR, -R (1.79) 


where R; is the position of an atom. If it is vibrating, then denote RO as the equilibrium 
position and Q; as the displacement from equilibrium: 


R; = R +Q; (1.80) 
The potential function is expanded in a Taylor series about the equilibrium position: 


V(R; — R,) = VRP — Ri”) + (Q; - Q) ve - R”) 


+5 - QQ- Q r aR, = VRP - R”) 


+ O(Q°) (1.81) 


The term linear in displacement vanishes, because one defines the equilibrium position R as 
the place where the sum of the forces on an ion j is zero: 


F, = E VVRP - R”) =0 (1.82) 
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The important term is the one which is quadratic in the displacements. It gives the potential 
energy of the phonons: 


l 
Va = (Qi -= QQ- Q) 


0) pO 
3? 
® (RO —R®) = —— y(R® - RO 1.84 
uv l J ) AR, AR, ( l J ) ( ) 


The interaction is evaluated in wave vector space by trying an expansion of the form 


1/2 

, — iogh ik RO 

QO = D(a -) Ék Alap re Om + aly erm eR (1.85) 
k, k 


where M is the ion mass. The factor of i on the right-hand side of the equation is required to 
make Q = Q; since it represents a real displacement in space. Take the Hermitian conjugate 
of (1.85) and change k —> —k. Nearly the same result is obtained as found in Q;. To make it 
Hermitian requires that 


Ein = —ő k, (1.86) 
The polarization vectors €,, are assumed to be real but change sign with k direction, 
E_k = —&, and the above identity is satisfied. Since the displacement is in three dimensions, 


there are 3L normal modes for each value of wave vector. Here L is the number of atoms per 
unit cell of the crystal. The index A runs over these 3L values of normal mode. Each mode 
will have its own eigenfrequency œp}. It will also have a polarization vector ¢,, which 
specifies the vibrational direction of the ion for each wave vector and mode À. If there are 
more than one atom per unit cell, one should add further subscripts to M and €, , to specify 
the values for each atom per unit cell. 

The right-hand side of (1.81) may be written as 


M 
= DVR? -RP) +7 E Qe Qi (1.87) 
ij , 


The first term is a constant which will be neglected in our discussion of vibrational modes. 
The eigenvalues @,, are those solved in the harmonic approximation. In this approximation, 
one retains the quadratic term only in the displacements in the Hamiltonian. To be more 
careful, a third-order anharmonic term V can be added for the case of one atom per unit cell: 


H =H, +Vj (1.88) 
1 

Ay = uM 2 (Pra, Ply tM of, Qh, Qa) (1.89) 

= 3 opa (alaa +4) (1.90) 


PV 


a (1.91) 
© AR, ORAR, 


1 
n=; ZQ: — Q)),(Q; — Q),(Q; — Q) 
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and one solves Hy for the modes. These modes are harmonic oscillator states for each wave 
vector k and mode à. The harmonic approximation applies to any theory of phonons which 
retains only the terms which are quadratic in the displacements Q,. Actual solids are described 
by potential functions which are more complicated than the central force field V(R; — R,) 
which we have assumed. For example, in semiconductors there are usually bond bending 
forces between nearest neighbors. Nevertheless, the Hamiltonian is still written as (1.90) in 
the harmonic approximation. A complete description of these calculations is given by Born 
and Huang (1954) or Maradudin et al. (1963). 

The terms in the Taylor series higher than quadratic are treated as perturbations. These 
are called anharmonic effects and are very important in solids with light atomic masses: 
hydrogen, helium, lithium, etc. They are often important in some other solids. The first term is 
usually cubic in the displacements and has the form shown above. In terms of wave vectors it 
becomes 


V3= >) DY Qe Qu, Q-k-g.n, M (K, Dara, (1.92) 
k,q A, A5A3 


The matrix element in the cubic term is quite complicated, and it will not be written out. It is 
difficult to determine from first principles anyway. If the first two displacements Q have wave 
vectors k and q, the third has —k — q to ensure wave vector conservation. This interaction 
may be written in terms of creation and destruction operators by using (1.85). In this 
representation, it is apparent that these cubic terms permit one phonon to decay into two and 
vice versa. 

For solids in which the anharmonic terms are important, one must try to include the 
effects of the cubic perturbation V, and perhaps also higher terms such as quadratic. Including 
these terms is a many-body problem. The effects are quite temperature dependent, so it is 
necessary to use Green’s functions at nonzero temperatures. 

A word about notation. We dislike subscripts and superscripts. In discussing phonons, 
the subscript à will usually be omitted, although it should be carried in every expression. The 
summation over phonon modes is really meant to imply summation over wave vectors and 
modes À. 


1.2. SECOND QUANTIZATION FOR PARTICLES 


There are two ways to introduce the subject of creation and destruction operators for 
particles. The first is to describe their properties and then to omit any proofs. One could just 
remark that they work, which is why we use them. The second way is to go through elaborate 
justification arguments. These tend to leave the reader more confused than convinced. Here an 
intermediate approach is tried. A short justification will be attempted. Our discussion follows 
Schiff (1968). 

The first treatment is for boson particles which cannot be destroyed. It is hard to think of 
a fundamental particle with this property. The method is usually applied to composite 
particles such as “He which contain even numbers of fermions, so that it has boson-like 
properties. In any case, it is assumed the discussion is for a point particle. If it is in a potential 
U(r), the one-particle Schrödinger equation is 


202 


ii) = HW(r) = -4 V 


zz + vo] Wr) (1.93) 
m 
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This equation may be derived from the Lagrangian density 


L=inviw— Foy" Vw — U(r, Oty (1.94) 


The wave function is complex, with real and imaginary parts. These two parts can be treated 
as independent variables in the Lagrangian. An alternate procedure is to treat y(r) and yil) 
as independent variables. Then the usual variations give 


OL 
—=-Uy' 1.95 
ay 7 UY (1.95) 
aL h? 
=——y! 1.96 
ay 2m Wy ( ) 
OL 
— = iyt (1.97) 
oy 
When these relations are put into Lagrange’s equation, 
ðL 0 L ð aL 
o=% y2 \|-2% (1.98) 
ay T ax \ a(dW/ax,) ot ay 
t B ynt gy 
0 = -—UW' +— Vy — ih >y (1.99) 
2m ot 


the Hermitian conjugate of Schrédinger’s equation is recovered. If the same manipulations are 
tried with y? as the variable, then Schrédinger’s equation itself is derived. In the Lagrangian 
formulation, the momentum which is conjugate to the variable is 


T = xi = iyt (r) (1.100) 


The Hamiltonian density is given by 
, h2 
H=my—-L=—Vvy' -Vy + uyly (1.101) 


where one integrates over all volume to obtain the Hamiltonian 
h2 
H = [ern = far(-Tv +0) (1.102) 


where the kinetic energy term was integrated by parts. Since m and W are conjugate variables, 
they obey commutation relations of the form 

[Wcr, t), n(x’, D] = ihd(r — r’) (1.103) 
or using (1.100), 

Wor, D, Wr’, D] = èl — r’) (1.104) 


A commutation relation of this type is the fundamental basis of second quantization. 
Although it has been made plausible by the derivation from a Lagrangian, it really is a basic 
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premise. These commutation relations may be satisfied by introducing creation and 
destruction operators. Let H have eigenstates and eigenvalues of the form 


Hy, = & 94 (1.105) 
h2 
H = —— V? + U(r) (1.106) 
2m 
The wave function W(r) and its conjugate yi (r) are expanded in terms of this basis set: 
Wr) = 2 a(t), (1) (1.107) 
V'@) = La, Oi (1.108) 


The original field commutators (1.104) are satisfied if a and at operators have their own 
commutation relations: 


[a,(t), atO] = Syy 


[a,,(0), a/(0)] = 0 


la} (6), a1,(f)] = 0 (1.109) 
The commutation relations for the field variables are: 
[W(r, 2), Va", A] = 0 (1.110) 
Wor, A, yie, D= 0 (1.111) 
Wr, D, Wr’, D = Lla), a (ADP) (1.112) 
= 2 ,(r)>, 0") = 6 — r’) (1.113) 


One might also ask about the commutation relations at different times. How does one evaluate 


[a,(t), a1, (t’)] = 2 (1.114) 


The answer is that evaluating this expression is a many-body problem. In fact, that 1s one of 

the goals of Green’s function theory. The commutator at different times is related to the 

retarded Green’s function, which is defined and discussed in Sec. 3.3. The commutator at 

different times is one property of the time development of the many-body system. Simple 

commutation relations such as (1.109) are valid only if the operators are at the same time. 
The Hamiltonian is 


H= faery! WHY) = Y alay [Erion = e,alay, (1.115) 
AA A 


In Sec. 1.1 it was noted that any system with these commutation relations, and a Hamiltonian 
(1.115), behaved as harmonic oscillators for each state à. The eigenstates for each value of À 
have a discrete set of occupation numbers n, = 0, 1,2,3,.... All bosons have harmonic 
oscillator eigenstates. For phonons, the number n, is interpreted as the number of phonons in 
state À. For particles, the interpretation is the same. The number n, tells how many particles in 
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the system are in the same state A. However, for particles, unlike phonons, the total number of 
particles is conserved. The many-particle wave function has the form 


(aly 
ny} 


Il, |0) (1.116) 


so that the Hamiltonian (1.115) has the eigenvalue of 


E= em (1.117) 
À 


In thermal equilibrium, the average number of particles in a state À is given by the usual 
boson occupation factor: 


1 
(ny) = Few 2) 7M = ne — p) (1.118) 


Now there is a chemical potential, 1, which can vary with temperature and concentration. It is 
absent in the phonon, and photon, cases because these excitations do not conserve particle 
number. One may make as many phonons or photons as one wishes. Another operator of 
interest is the density operator: 


P(r) = WOW) = Yaya pimp) (1.119) 
AA 
The integral of p(r) is just the number operator: 
N= [rw = Yaa, (1.120) 
À 


Its thermal average is obtained simply by taking the thermal average of n,: 


(N) = } (m) = LM, (1.121) 


À 


This equation serves as a definition of the chemical potential and determines its variations 
with temperature and particle number N. 

The Hamiltonian (1.115) and number operator (1.120) are bilinear in creation and 
destruction operators. They contain only two operators, one of each kind. Hamiltonians of 
these kinds may always be solved, at least in principle. The problem may always be reduced to 
the diagonalization of a matrix. For example, consider the solution of our Hamiltonian: 


h2 


H = —— 
2m 


V? + U(r) (1.122) 
Suppose that we were unable to solve it exactly—which is an improbable assumption, since 
Schrödinger’s equation for one-particle potentials may be solved in milliseconds on the 
computer. Anyway, suppose there were another complete set of states , which are the 
solution to some other Hamiltonian. Expand the wave function in terms of these states, 


Wr) = > bnp, (0) (1.123) 
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where the creation and destruction operators 5, have the usual commutation relations: 


[Dns Oh] = nm (1.124) 
[b,,, b,,] = 0 (1.125) 
[bi bt] =0 (1.126) 
For the Hamiltonian and number operators: 
H = YỌ bib, Hom (1.127) 
N= bib, (1.128) 
Hn = | rd; ()H pnl) (1.129) 


This Hamiltonian may be solved in the following fashion. Examine the equation of motion for 
the destruction operator: 


3 | 
Sim by = [H,b,) = — >> Aum Om (1.130) 
It is assumed that an operator has the time development 
b(t) = b,(0)e™ (1.131) 
so that the solutions are of the form 


The eigenvalues Æ are the solution to 
det| Hm — E9nm| = 0 (1.133) 


It is only necessary to find the eigenvalues of the Hamiltonian matrix. Usually the matrix is of 
infinite dimensionality, since there are an infinite number of states in the set o,. But one may 
often diagonalize it exactly for many problems. Computers allow very accurate solutions for 
any case of interest. If all Hamiltonians had only bilinear operators, then many-body theory 
would only be an exercise in matrix diagonalization. Fortunately, it is more fun than that. 

Many-body theory is used to study Hamiltonians which have additional terms. These 
terms may be interactions with phonons, spin effects, or particle—particle interactions. The 
effects of particle—particle interactions may be understood by examining a many-particle 
Hamiltonian of the form 


l 
i 2 iH 
h? _. 
A, = — — , 
i zn yY tU) (1.135) 


The first term contains a summation of one-particle Hamiltonians H;,. This term by itself is 
just as simple to solve as H; alone. A collection of particles which do not mutually interact 
makes for a trivial problem. One solves the dynamics of one particle, and the total properties 
are the summation of the individual ones. The term which makes the Hamiltonian hard to 
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solve is the particle—particle interaction V(r; — r;). This term is multiplied by one-half since 
the double summation over (ij) counts each pair twice. This interaction is written in terms of 
creation and destruction operators: 


H = Hpnbibn +4 Vidnn OO byb 
nm klmn 


Vem = |r | Pr oie doce) V(r, — ro Eo) (1.136) 


The interaction term contains two creation and two destruction particles. This term is inter- 
preted as describing two-particle scattering events. One particle in state / scatters to state k, 
while another in state n scatters to state m. The process is illustrated in Fig. 1.1. Each index 
(k, 1, m, n) runs over all possible values. There are processes where, for example, k = 1, which 
describes a process where one particle scatters from n to m, while the other does not change 
its state. This term was carefully written so that both destruction operators are to the right and 
the creation operators to the left. The reason for this arrangement is to eliminate processes 
whereby a particle interacts with itself. For example, if this expression is written as 


bb bib, (1.137) 


then we could have m = /, and this term would describe how one particle interacts with itself. 
For example, if |0) is the particle vacuum where all n,, = 0, then b,|0) = 0. The state bt 10) 
contains one particle in state «. The interaction operator 


2 2 Veimnd bh bnbib}10) = 0 (1.138) 


on this state gives zero, since two particles cannot interact if there is only one particle in the 
system. However, the incorrect version of the interaction gives 


pe V itn}, bib], b„b} 10) = 2D V omma b110) (1.139) 


is nonzero, and this term (« = n, l = m) must just be the particle interacting with itself. These 
terms are avoided by writing the pairwise interaction in the form (1.136). 

For a gas or liquid of “He particles, a common basis set is just free-particle wave 
functions, 


Wr) = — Y e" Tag (1.140) 
k 


FIGURE 1.1 
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in which case the Hamiltonian has the form 


l 
H=)> ealag +— V (qai qth gtk’ A 


k2 
Ek = Im 
V (q) = | drenv (1.141) 


The operator p, is the particle density operator in the plane-wave representation: 
Pa = E aktga (1.142) 
The simplest way to write the interaction term in (1.141) is in the form 


PLONE (1.143) 


which is defective because both destruction operators are not to the right of the creation 
operators. There is a term where the particle interacts with itself. 

A possible difficulty with (1.141) is that V (q) may not exist. The potential V(r) may not 
possess a Fourier transform if the particle—particle potential is too divergent at small values of 
r. This divergence happens, for example, with the Lennard-Jones potential, which is often 
used to represent the heltum—helium potential. This difficulty may be avoided by summing 
subsets of diagrams to get a T-matrix interaction, which is always well behaved. 

So far the discussion has concerned boson operators and boson Hamiltonians. Now 
consider fermions. These are usually electrons, although occasionally one studies holes, 
positrons, or °He particles. Fermions have the property that any state may contain only zero or 
one particle, which is the famous exclusion principle. Jordan and Wigner (1928) discovered 
that Fermi statistics could be accomplished by making the fields anticommute, which is 
represented by curly brackets: 


VOW (r) + VOW) = (HO), Wr} = Sl- r’) (1.144) 
(Wr), ye) = 0 (1.145) 
(i (r), Wr} = 0 (1.146) 


If these wave functions are expanded in a basis set ©, (fr), 
V(r) = 2 Cr Ox (r) (1.147) 


VO) = Ladi (1.148) 
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the coefficients cl and c, become creation and destruction operators which obey anti- 
commutation relations: 


{c}, ct} = O44! (1.149) 
{c,, Cy} = 0 (1.150) 
(cl, cl} =0 (1.151) 


For example, consider {c}, c} = 2c,c, = 0. The operator c,c, acting upon anything gives 
zero, since c} is a destruction operator which destroys a particle from a state À. A state may 
only contain zero or one particle, which are called |0), and |1),, respectively: 


ell), =10), fll), = 
10), =0 — f|0), = l, (1.152) 


So cc, acting upon either |1), or |0), gives zero. Similarly, the combination chet = 0. It is 
zero because two particles cannot be created in the same state. Another way to see this result 
is to consider the number operator for a state, 


N, = che, (1.153) 
and its square, 
N? = clocl ey (1.154) 
Using the anticommutation relations cc? = 1 — cte gives 
N? = t(l — ce) C= che, — ch clone, = N, (1.155) 


The only numbers which are equal to its square are 0 and 1. The number N, may only be 0 or 
1. The anticommutation relations have built into them the fermion property that no more than 
one particle may be in the same state À. 

Quite often the Hamiltonians for fermion calculations are of the form 


nay |es U(r) +5 >> yer) (1.156) 
i | 4m jHi 


with the particles interacting with a potential U(r) and with each other through particle— 
particle interactions V(r; —r,;). The Hamiltonian is written in terms of creation and 
destruction operators exactly as in the boson case: 


H = Y H pmC}Cm +4 > V rimni ch CnC] 
nm 


klmn 
2072 
Hay = | Brd,(t)"| - 5+ UO) on 
Vam = | Pr, | Pr df OEM, — oE (1.157) 


The difference in behavior between fermions and bosons is often due to the difference in 
the commutation relations of the operators. The starting Hamiltonians are of similar form, 
e.g., for liquid *He and “He. 
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The particle—particle interaction term is still interpreted in terms of Fig. 1.1. Great care 
must be used in writing this term, or else one makes a sign mistake. For example, the term 
l t 
5 > V gtoan© Connon (1.158) 
klmn 


is wrong. The order of the two destruction operators was exchanged. But because of the 
anticommutation relation 


CCa = — CC] (1.159) 


this interchange causes a sign change. Equation (1.158) would be correct if the entire term 
were multiplied by —1. 

For the study of electrons in solids, a popular basis set is plane waves. Eigenstates are 
described by (p, o), where o is the spin index, which is +1 for spin up or down. The 
Hamiltonian then has the form 

H = 2. EpChoCpo + 2, U(q)Pq + L » Valk qoCK qok oCko (1.160) 
po q qkk'oo’ 
The second term represents the interaction between the electrons and the atoms or ions of the 
solid. The interaction U(q) is often represented by a pseudopotential (Harrison, 1966; Heine, 
1970). 
The electron density operator 


p(q) = Chi goCko (1.161) 
oO 


is the same as for bosons except for the additional summation over the spin index. The last 
term in (1.160) contains the electron—electron interaction, which is just a Coulomb potential. 
The Fourier transform of a Coulomb potential e*/r occurs often and has been given a special 
symbol with lowercase v,. The Fourier transform is 


dr, °° l 
Vz = e [Fer = 2ne? l rar | d(cos @) e'9”°8° 
r 0 -1 


2 2 foo , , 4 2 fo 
= | dr(e'” —e'V)= | dr sin(qr) 
ld Jo 
Ane’ 
= PA (1 — lim cos(qr)) (1.162) 


The integral is not well defined, since it oscillates at infinity. It is assumed these oscillations 
damp out, so that the result is 
2 
— ane” (1.163) 
q 
The electron—electron interactions are a significant part of many-body theory. Most calcu- 
lations involve worrying about electron—electron interactions in one form or another. 

The full electron gas Hamiltonian (1.160) is often too complicated to use for the more 
elaborate many-body theory. Quite often it is approximated by a model Hamiltonian which 
has a simpler form. Usually these model Hamiltonians look very simple but still are 
impossible to solve exactly. Often they are even difficult to solve approximately! Some of 
these popular models are discussed next. 


Ug 
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The homogeneous electron gas is a model which is studied frequently to learn about 
correlation effects. It has the Hamiltonian 


H= > Epp pol > Aal taofi qo lK'o'Cko (1.164) 


Ty q40,kk’ co’ 

The basic premise is to get rid of the atoms and to replace them with a uniform positive 
background charge of density nọ. The homogeneous electron gas is also called the jellium 
model. One can think of taking the positive charge of the ions and spreading it uniformly 
about the unit cell of the crystal. Of course, the homogeneous electron gas then has no crystal 
structure. To preserve charge neutrality, the average particle density of the electron gas must 
also be no. The average density of the electrons is just the q = 0 value of the density operator, 


l l l N 
—— = 0) =- t = — N. = — 1.165 
5 (0 = 9) = Elchi) = E (1.165) 
since the number operator may be summed to give the number of particles N,. In writing the 
Hamiltonian (1.164), the q = 0 term in the interaction term was omitted from the summation. 
The omitted term has the form 


NAN. 7 1) 


a dim 0, (1.166) 


This term was canceled by two other terms. One of these is the Coulomb interaction of the 
uniform positive background with itself: 


drr N? 
1,2 
xe Ng 3 zy UM YG (1.167) 
The other is the Coulomb interaction of the uniform positive background with the electrons: 


3.73 2 
d a -2 lim Vg (1.168) 
q> 


—e no | p(r) 


The N2 term cancels when these three terms are added, and the other term may be neglected. 
The Hamiltonian (1.164) describes a system which has charge neutrality. The interaction 
terms with q Æ 0 describe the fluctuations which occur because of electrons interacting with 
each other. 

The plane-wave model is often a poor approximation of electron behavior in ionic solids. 
In many solids the electrons are localized on atomic sites and only occasionally hop to 
neighboring sites. This behavior is described by the tight-binding model. One simple form of 
this model is bilinear in the operators: 


H= LM, Ch soô (1.169) 


The index j denotes a site at point R,, while j + 6 represents the nearest neighbor atoms. The 
number of nearest neighbors is called the coordination number and is represented by the 
symbol Z. One can think of the term 
¥ h? 
W; = [are (r — e-i — V? + vo Joo — R4) (1.170) 
as arising from the matrix elements between the orbitals ọ¢(r) which are localized on sites R; 
and R,,5;. The term W; for 6 Æ 0 represents processes where the electron jumps from site j to 
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j +6, while W, is the site energy. The potential U(r) is periodic. Simple versions of the 
model usually have only a single orbital state for each atomic site. More realistic versions of 
the tight binding model allow for the multiple orbitals characteristic of p- or d-electrons. The 
discussion will assume a single orbital state per atomic site. 

The bilinear form of the Hamiltonian is trivial to solve exactly. One defines the operator 
in wave vector space in the usual fashion, 


l ik-R, 
Cio =R" R eko (1.171) 
l ik R, 
Cko = wre Ri Cig (1.172) 
: J 


and the Hamiltonian may be rewritten as 


H = Wk)c -cko (1.173) 
ko 

Wk) = Y e’ W, (1.174) 
Ò 


If the cjo are fermion operators, then so are the c,,. They obey anticommutation relations 
{Cko> cl} = 6,-4/55,’. Each mode (ko) becomes an independent Fermi system, which may 
be treated separately in thermodynamic averages. The particle energy is W(k). A very 
common model is the nearest neighbor model. Here the hopping term is limited to just the 
nearest neighbor ions, which are presumed to be all alike. Usually the site energy is Wọ set 
equal to zero, which is just an arbitrary energy renormalization, and W; = w. Then the energy 
is W(k) = wZy,, where the factor 


Zy% = zen (1.175) 


is summed over the Z nearest neighbors. This solution is exact for the tight-binding model. 
For example, the partition function Z, for fermions is written as 


Z, =e PO = Tre PAY) = TT {1 + e0) (1.176) 


where Tr denotes the trace. The many-particle partition function (1.176) is obtained by 
treating each state k as independent and averaging its thermodynamic properties separately. It 
appears as if each electron in each state k is behaving independently. This picture is deceptive. 
The electrons are not just whizzing around independently. 

For example, if one were to calculate the probability that any two electrons of the same 
spin are on the same atomic site, a zero answer is obtained. Two electrons of the same spin are 
never on the same site in the single orbital models since they would be in the same state. Of 
course, it is a basic feature of fermion many-particle wave functions that two electrons of the 
same spin can never be in the same location. The motion of the electrons is correlated, which 
arises from the antisymmetry of the wave function under exchange of particle position. The 
motion of the electrons is not really independent. 

This correlation is built into the eigenstates but does not affect the energy in the simple 
bilinear model, which ignores interactions between particles. The partition function (1.176) 
has the appearance of independent particle form, even with correlation in the wave function. 
The antisymmetrization of the many-particle wave function does not affect the expectation 
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value of one-particle operators. It does affect the expectation value of two (or more)-particle 
operators. 

The tight-binding Hamiltonian may also contain the Coulomb interaction between 
electrons. In its most general form (1.157), the interaction term is 


1 
7 2 V kimn ciel, CnC] (1 -l 77) 


x e 
Vann = [er Jeno ti — R,)o(r, - R) —— 
lr; -r 


x o*(r, _ R,, )0(r, ~~ R,) (1.178) 


The four orbitals could be centered on four different sites. These are called four-center 
integrals. They are usually small and nearly always neglected in many-body calculations. The 
terms which are included are just the largest Coulomb terms. One possible term is the direct 
interaction between two particles on different atomic sites. For example, setting k = / and 
as 


joo ROP 16" (2 — Rw (1.180) 


Note the operators can be rearranged into n,n,, as long as k Æ m, so that one does not have a 
particle interacting with itself. At large separation the interaction terms V,,, become just a 
Coulomb potential, 


e? 


ROM o Vien > RR] (1.181) 
if the orbitals have s-symmetry. A Hamiltonian with these nearest neighbor Coulomb inter- 
actions is similar to the lattice gas Hamiltonians introduced in Sec. 1.4. 

The Hubbard model (1963) retains only the Coulomb integral which is the very largest. 
All four orbitals (r) are centered on the same site m. This term describes the interaction 
between two electrons which are on the same atom. Since two electrons cannot be in the same 
state, the two on the same atom must be in different atomic states. In the simplest model, 
which considers only a single orbital state on each atom, the two electrons must have different 
spin configurations. One has spin up, while the other has spin down. The Hubbard model 
considers the following Hamiltonian: 


H = we cls ojo + Mod CfoCjn + U E nyn (1.182) 
jo ; 
U = Vi = Jar [noe z SED (1.183) 


The hopping term is usually limited to nearest neighbors. This Hamiltonian was also intro- 
duced by Gutzwiller (1963), who studied the properties of electrons in d-bands in ferro- 
magnets. It was then extensively studied by Hubbard (1963—1966). It is thought to be a good 
model for electron conduction in narrow band materials, for example, in transition metal 
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oxides (Adler, 1967). The Hubbard model has been investigated thoroughly over the past 
forty years, and its properties are starting to be understood. They are discussed in Chapter 6. 

The parameter U is the Coulomb interaction between two electrons on the same atom. 
Usually the model is applied to tightly bound orbitals such as d- or f-states. Then U is quite 
large, perhaps 6 eV. The bandwidth Zw is sometimes taken to be smaller. However, some of 
the most interesting phenomena seem to occur for U ~ Zw. The Hubbard model can be 
solved exactly only in one dimension, as shown by Lieb and Wu (1968). But there are two 
limiting cases where exact solutions can be obtained in other dimensions. One is where 
U = 0, which is the nearest neighbor tight-binding model. The other case is where the 
hopping bandwidth w = 0. It is the atomic limit, since here each atom is considered indi- 
vidually, independently of the others. The energy in this atomic limit is 


E = WN, +nU (1.184) 


where N, is the number of electrons and n is the number of sites with two electrons. 

Creation and destruction operators are used to describe other kinds of operators besides 
Hamiltonians. We shall frequently use density and current operators. The density operator 1s 
summed over the position of all particles: 


p(r) = 2 o(r — r,) (1.185) 
j 


p(r) may be expressed in terms of creation and destruction operators as 
ptr) = WiC) = 5 Chon iO (0) (1.186) 
The Fourier transform of the density operator is also needed: 
pq) = ja rei" p(r) = 2 Chey [Erio e'ar (1.187) 
7 
The two most popular representations are the free-particle model: 
p(q) = > Chig.oCke (1.188) 


and the tight-binding model when omitting overlap between neighbors: 


p(q) = n(q) E net (1.189) 
jo 
n(q) = |a (1.190) 


Another important operator is the electrical current. It is the summation over all particles and 
their velocities: 


iw) =4 2 elv — r;) + 8(r — rv] (1.191) 
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The summation above is over different groups of particles. Each group of particles contains 
identical particles with the same charge e,. Let j; be the particle current for each kind of 
particle species. Then the electrical current operator is 


jr) = 2 ejr) (1.192) 


in) = 5 WIV) — VOTH") (1.193) 


where one sums over each particle species i and the particle current j; for each species. The 
second equation is the standard quantum mechanical representation for the particle current. 
The Fourier transform of the current operator has the form 


e l —iq rra * * 
O OAO O O (1.194) 
n 
For free particles, this expression has the form 
, 1 
1D) = OK +3 Dekra ocko (1.195) 


Another case of interest is the current operator for the tight binding model. It is easier to 
consider an alternate, but equivalent, formula for the current operator. The derivation starts 
from the definition of the polarization operator in terms of the particle density p(r), 


P = | row (1.196) 


which is a summation over all the particles and their positions. One then recalls that the time 


derivative of the polarization is just the particle current: 
0 0 
~P = | @rr— 1.197 
GP = | ares te. (1.197) 


This relationship can be proved easily by using the equation of continuity, followed by an 
integration by parts, 


p(r, t) = -V : j(r, 2) (1.198) 
<P -= [env jí, ) = | Pri) . Vr (1.199) 
- l Bri(r, t) (1.200) 


In the tight-binding model, the polarization operator has the form 


P= Rp», (1.201) 


where R; is the position of a lattice site, and n; is the number of particles at that site. The time 
derivative is 


ð 
j = ZP = ilH, P] (1.202) 
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For example, if the Hamiltonian has the form 
H = wEc Cig ojo t3 JÈ Nighy V; (1.203) 


then only the first term contributes to the current operator. The other term contains only the 
position operator n,,, which commutes with itself. Here s is the spin index. In this case the 
current operator is 


is? 


= = -iw L öc Ci48,0Cj0 (1.204) 


This current operator is used in calculations on localized electrons. It applies to organic solids 
and narrow band ionic solids. 

The energy current operator is needed to calculate energy transport in solids, which 
occurs, for example, in discussions of thermal conductivity or thermoelectric effects. Energy 
currents flow whenever heat is generated or dissipated nonuniformly in the solid. The energy 
current j, is defined as the energy flow through a surface. It obeys an equation of energy 
conservation, 

ð , 
ati tV de =0 (1.205) 
where the energy change dH /dt equals the variation in the energy flux. An equation for the 
energy current may be derived by formally introducing an operator which is the integral over 
the position and Hamiltonian density: 


Rg =} | @r[rH(r) + H@)r] (1.206) 


The equation of energy continuity (1.205) may be used to show that the time derivative of this 
quantity is just the energy current: 


0 


Ifa fð 3 
R= fa [rèno + 5 Meo =i, (1.207) 


For a free-particle system, the energy current is 


jz = > Vp£pChoCpo (1.208) 


This result is sensible. It is just the energy Ep = p?/2m of each particle multiplied by its 
velocity Vp = p/m. In the nearest neighbor tight-binding model, one writes Rz in terms of the 
site Hamiltonian A; and position R;;: 


Rz = RA; (1.209) 
H= hj (1.210) 
jz = HR, hj] (1.211) 


Ww 
h; = 2 S (el 5 ocio + cl, Ci+8, o) +5 24 > Nish; s Vi (1.212) 
50 
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For example, the current operator from just the hopping term for electrons is in one 
dimension: 


Je = 


w 
I> D (8 + ò’) c) gya Ci (1.213) 
i55's 


The other terms can become quite complicated, and will be introduced only as they are 
needed. 

There are several other comments. First, the operators P and Rg, are not defined in 
infinite systems, since the integral over position will diverge. One can devise an alternate 
definition 


P=—ilimV, [drome (1.214) 
q—>0 


R; = —i lim Va [öne H(r)} (1.215) 
q> 


The second comment concerns the energy current. As discussed extensively in Sec. 3.8, the 
energy current is often not the current which describes thermal conductivity or thermoelectric 
power. In metals, it is customary to use the heat current, which is defined as 


jọ = jz — yj (1.216) 


is given in terms of operators jp and j, which have already been defined. 

Another observation is that these operator definitions are the same for all particles, 
regardless of whether they are bosons or fermions. Of course, spinless bosons do not have the 
summation over spin index o. Otherwise, everything is the same. However, calculations using 
these operators depend significantly on whether the particles are bosons or fermions. 


1.3. ELECTRON-PHONON INTERACTIONS 


The first two sections described the Hamiltonians for phonons and electrons, respec- 
tively. This section discusses their mutual interaction. This topic is important in many-body 
theory. The electron-phonon interaction causes superconductivity in many metals and 
influences the transport properties of every metal. In pure semiconducting and ionic solids, 
the electron—phonon interaction usually dominates the transport properties. The word polaron 
is used to describe a single electron which is coupled to phonons. The modern formulation of 
the polaron problem was due to Frohlich et al. (1950), and its study is an important part of the 
history of many-body theory. 
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1.3.1. Interaction Hamiltonian 


The basic Hamiltonian is assumed to have the form 


H =H, +H, +H, (1.217) 
H=} Dga dq (1.218) 

q 

2 enl 
Pi 

H, = — +>) — 1.219 
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The atom part H, describes the normal modes of vibration of the solid and is the phonon 
Hamiltonian of Sec. 1.1. The second term is the electron part H,, which was discussed in Sec. 
1.2. The third part is the electron—ion interaction. It is assumed that H,; is the summation of 
the interaction between the individual atoms at R; and the electrons at r;: The word ion is not 
meant to imply a particular charge state. In metals, the atoms are ions, while in covalently 
bonded semiconductors they are something else. The word ion is used to encompass all these 
possibilities. Each ion is at a position R; = = RO +Q,, which is the sum of the equilibrium 
position RO and the displacement Q,. The displacements are usually small, so that one can 


expand in ‘powers of them: 
Veit; — RP — Q) = Valt; — RP) — Q; + VVar; — Ry) + OQ’) 


The linear electron-phonon interaction term is obtained from the first term in Q;. The terms in 
O(Q7) are neglected here, although they are retained in some circumstances. The constant 
term 


2 V(r; — Ry”) (1.221) 


is the potential function for the electrons when the atoms are in their equilibrium positions, 
which forms a periodic potential in a crystal. The solution of the Hamiltonian for electron 
motion in this periodic potential gives the Bloch states of the solids. They are usually assumed 
to be known. Many problems begin by writing the Hamiltonian as H = H) + V, where Ho is 
a Hamiltonian which can be solved and V is the perturbation. Quite often the eigenstates of 
Hy are just the Bloch states of the solid, calculated by assuming that the atoms are in their 
equilibrium positions. The sequence of approximations we are making is called, collectively, 
the Born—Oppenheimer approximation. As always, there are circumstances where these 
approximations are inadequate, and other approaches are necessary, e.g., in the dynamical 
Jahn-Teller effect. 
The electron—phonon interaction is 


V(t) = EQ; Val — Rj”) (1.222) 
J 
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This interaction is to be written in terms of operators. It is assumed that the electron—atom 
potential possesses a Fourier transform: 


1 , 
Valt) ==> Valet" (1.223) 
q 
Walt) = E E Valade! (1.224) 
q 
Evaluate the combination 
i i. ia RO 
Volt) = EE Valde "q + (x Qe” ) (1.225) 
q j 


The earlier definition (1.85) of Q, is used to show that 


i 2 Qe R” = a E Qu. (1.226) 
P 1/2 
=-} (ra) bata laq + 44) (1.227) 


where the summation G is over the reciprocal lattice vectors of the solid. The phonon states of 
q + G are defined only within the first Brillouin zone of the solid. Here the values of q may 
be outside the zone, so that q + G projects them back into it. But the Fourier transform 
V.q + G) is defined over all values of (q + G), not just the first Brillouin zone. Write the 
interaction Hamiltonian in the form (MN = pv, p = density of solid in grams per cubic 
centimeter) 


1/2 
, h 
— Ir’ G . + 
V(r) = — 3 e (q+ V (q + G)(q + G) Eq (=) (ag + aLa) 


The summation over q is restricted to be within the first Brillouin zone of the crystal. The 
phonons are defined only in this space, so @q, dg, Ég have only q labels. But the summation 
over reciprocal lattice vectors G permits the potential V,,(q + G) to interact with higher 
Fourier components. The notation also dropped the summation over the various phonon 
modes A. Such a summation should be done, and we always do it, but writing A in every 
subscript is a nuisance. So just do the summation over modes, and skip the writing of A in 
every subscript! 

The potential V,,(r) is defined as the unscreened electron—atom potential. Later it is 
shown that electron—electron interactions in metals cause a significant reduction of this 
potential, which is called screening. The potential V,, is sometimes calculated from first 
principles, but more often it is obtained from a pseudopotential (Heine, 1970; Harrison, 
1966). Modern calculations usually use force constant models to obtain accurate phonon 
energies @,, and polarizations €, throughout the Brillouin zone, for each mode of polarization 
and use them in calculating electron—phonon properties. 


Sec. 1.3 e Electron-—Phonon Interactions 29 


The potential V,,(r) acts upon the electrons and also upon other particles such as 
positrons. The electron—phonon interaction is obtained by integrating this potential over the 
charge density of the solid p(r): 


He = | Proy 
1/2 
__ e (fh 
= 3 p(q + G)V.(q + Ga +G): 6q (; OV z) (ag tala) 


The particle density operator p(q) was defined in the prior section. Several examples of it 
were given there. Quite often this matrix element is abbreviated by the symbol 


1/2 
h 
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1.3.2. Localized Electron 


There is one problem which can be solved immediately: the electron-phonon Hamil- 
tonian when the electrons are fixed in space at positions r;. It is assumed that they cannot 
recoil, which neglects the electron kinetic energy term. This model is often applied for 
localized electrons in solids. A localized electron occurs in deep core states and in some 
impurity levels. Then the Hamiltonian has the form 


H =H, + Hg (1.230) 
1 , 
— Elos (aan + z) + 2 laq + at et" 
x X Maromla + aes" (1.231) 


The electron density operator n(q + G) is just the Fourier transform of the localized charge 
density: 


pa +G) =| dre" © hor = rf (1.232) 


= J} e" n(q +G) (1.233) 


na +6) = | Pre polr) (1.234) 
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The various interaction terms are collected into an effective matrix element: 


Fr) = 3 n(q + G)e""M +6) (1.235) 


H = Ljos (aja. + 5) +} (aq + ale" F Ar) | (1.236) 
q i 


The function F,(r) is periodic in the lattice, since increasing it by a lattice vector a does not 
change its value. It is assumed that all localized electrons in different unit cells are in the same 
position within the cell. That is, F,(1) is assumed to be the same for all localized electrons. 

This problem has been solved before. It is just the harmonic oscillator in an electric field 
[Eq. (1.38)]. Now each wave vector and polarization state q is a separate harmonic oscillator, 
which finds its own equilibrium configuration. Follow exactly the steps used to solve (1.38), 


and the creation and destruction operators are transformed to the new set 


Fi , 

A, =4,+ re ean (1.237) 

Al =a 4 M9 ian (1.238) 
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The Hamiltonian with these operators is 
H= o, |44 ii y Fal y gan (1.239) 
q ATT 2 q % i 


Furthermore, the new operators still obey the harmonic oscillator commutation relations 


[Aa Ab] = ag (1.240) 
[Aa Aq] = 0 (1.241) 
[At, Ab] = 0 (1.242) 


AL)" 

Ing) =! 4 |()) (1.243) 
ng! 

E=Se,(n +4) -g penp (1.244) 
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The eigenstates are interpreted in the same way as used for the simple spring: Each normal 
mode q has stretched to a new equilibrium configuration, 


1/2 
F , 
o? = -2/ f ) Tay eiA, (1.245) 
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and now oscillates about this new equilibrium point. The oscillation frequencies do not 
change. The last term in (1.244) is the relaxation energy. It is the potential energy gained by 
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stretching the springs—the phonon normal modes—to the new equilibrium positions. This 
energy term may be expanded in terms of the electron coordinates. The relaxation energy is 


F: , 1 
—_— y | a Y gai) — _ `> V(r; — r;) (1.246) 
q % ï 2i 
1 
i >J 
Fal? Bq Fal; 
V(r) = —2 > eT = —20 | de'a (1.248) 
R ao, Jem om 


Note that the summation over phonon polarizations has been added to the last expression, 
which is the integral one has to evaluate. 

The relaxation energy consists of two types of terms. The first is Vp(0), which is the 
relaxation energy of a single particle by itself. This energy is caused by the electron inducing 
a static polarization in the phonon field which acts back upon the electron. It is a self-energy 
effect. It shall be called a polaron self-energy or electron self-energy. The energy is not just 
with the electron. As with the stretched spring, it is in the combined particle—oscillator 
system. 

The other type of term is the interaction between pairs of fixed particles Vp(r; — r;). Here 
the physical picture is that one particle polarizes the medium, and this polarization field 
changes the energy of other particles which are nearby. This potential Vp(r) has different r 
dependence for different types of phonons in solids. In some cases, it is very short-ranged, so 
that two particles interact only when they are in the same unit cell of the crystal. In other 
cases, the potential falls as slowly such as r~', as if it were a Coulomb potential. Several of 
these cases will be presented later. 

The same many-body problem is discussed again in Chapter 4. The harmonic oscillator 
in a linear potential is an important model, if only because there are so few models which can 
be solved exactly. It is also the solution to the independent boson model. 


1.3.3. Deformation Potential 


In semiconductors and ionic solids, the excited electron states are usually confined to a 
small location in wave vector space. In thermal equilibrium, the excited states are at an energy 
band minimum, which is often at the zone center or edge. Usually the polaron effects for these 
electrons involve only phonons of long wavelength. In this case the tradition has been to 
parameterize the interaction rather than compute it from first principles. Most electron- 
phonon interactions in semiconductors use only three types of interactions: deformation 
potential coupling to acoustical phonons, piezoelectric coupling to acoustical phonons, and 
polar coupling to optical phonons. Another possible coupling is the deformation coupling to 
optical phonons. These interactions are valid only at long wavelength. When the electron- 
phonon matrix element is needed at short-wavelength phonons, the usual method is to 
calculate them from pseudopotentials. 

The deformation potential coupling to acoustical phonons is just the long-wavelength 
limit of (1.229). Only the G = 0 is retained, since the terms G + 0 are of wavelengths that are 
too short. The electron—ion potential is taken to be a constant (V,,(q) — D) at q — 0, where 
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D is the deformation constant. At long wavelength, & —> q, and only longitudinal phonons are 
important if the band is nondegenerate. The interaction has the form 


h 
H,, =D 
P aE 


where the summation is only over longitudinal acoustic phonons. However, valence bands in 
semiconductors are often degenerate at the band maximum. Then the hole excitations have a 
deformation coupling to transverse phonons, which is a very large polaron correction (Mahan, 
1965). 

The deformation constants are obtained by measuring how energy bands shift with 
increasing pressure on the solid. The value of D, for a band n is simply the rate of change of 
band energy with pressure (Thomas, 1961). 


1/2 
) Iqip(q)(aq + a4) (1.249) 


1.3.4. Piezoelectric Interaction 


Many semiconductors are piezoelectric. The macroscopic effect is that an electric field is 
generated when a crystal is squeezed and vice versa. Acoustical phonons, which are periodic 
density modulations, make periodic electric fields. The crystal must lack an inversion center 
to be piezoelectric. The group IV semiconductors Ge and Si are not piezoelectric. The III-V 
semiconductors such as GaAs are very weakly piezoelectric, while the II-VI materials such as 
CdS and ZnO are extremely piezoelectric. A very detailed derivation of the electron-phonon 
interaction has been given elsewhere (Mahan, 1972). Here we shall provide only a quick 
sketch. If S; is the stress on the crystal, then the electric field is proportional to the stress, 


ij 


where the matrix M;,, is a constant which gives the proportionality. The stress is defined as the 


symmetric derivative of the displacement field: 


1/80; 30; 
Si =5 (2 + 3 (1.251) 
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The electric field may be shown to be longitudinal and to point in the direction q of the 
phonon. It may be written as the gradient of a potential ¢(r) 


ð 
E; = T OW? = -5E id dye’ (1.253) 


The final result is the observation that the potential is proportional to the displacement: 


o(r) x O(r) (1.254) 
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. h ^ iq-r 
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The electron-phonon interaction for the piezoelectric interaction is 


h 
2PMgV 


1/2 
Ay, = Z( M,(9)p(q)(a, + ata) (1.256) 


The matrix element M, (q) does not depend on the magnitude of q, but it very much depends 
on its direction. It also has the property that M, (—ĝ) = —M,,(q) so that (1.249) is Hermitian. 
In fact, the piezoelectric interaction is quite anisotropic. The matrix element is also very 
dependent on the polarization à of the acoustical phonon whether it is LA (longitudinal 
acoustic) or TA (transverse acoustic). Most many-body calculations have tended to take a 
constant value for the matrix element, where this constant is obtained by averaging over the 
various angular directions in the crystal. This approximation is adopted here. 

An interesting result is obtained if (1.256) is used to calculate the effective potential 
energy between two fixed electrons: 


aq ~ My 
(2n)° T 2pm, 


Vp(r) = -2| (1.257) 


In a Debye model, the phonon energy is proportional to wave vector @g, = C,q, where the 
sound velocity c}, depends upon the mode. Since M, and p are constants, the potential Vp(r) 
is the Fourier transform of g~*. This transform was worked out earlier, in (1.163), and just 


gives a potential varying as r7!: 


2 


e 
yV = —Yy — 
R(r) Y 7 
1 oM 
= yk 1.258 
y gnep T È ( ) 


The answer is a form of Coulomb’s law. Actually, it is a law of dielectric screening. In 
dielectric materials it is known that the potential between two fixed charges is 


2 
e 
Votar) = —— (1.259) 


r&total 


where €,,,) is the total, static, dielectric constant. In the piezoelectric calculation, the piezo- 
electric contribution to this dielectric screening was derived. The total dielectric function is 
the summation of many contributions: 


Etotal = Goo + Epiezo + Epolar + Ee—e (1.260) 
Etotal = E0 + Epiezo (1.261) 


The first term £ is from interband electronic transitions. The piezoelectric contribution is the 
term under discussion. The polar and electron—electron contributions are explained later. All 
other contributions are £ọ. All of these contributions are functions of q and œ. In the present 
problem, with a static, fixed charge, these quantities are needed in the limit that q —> 0 and 
œ — 0. The electron—electron term is not finite in this limit, so probably should not have been 
included in the list (1.260). It may be omitted if there are no mobile charges present, which 
shall be assumed. 
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If there were no piezoelectric contribution, presumably the interaction potential between 
two fixed charges would be e?/(eọr). The piezoelectric contribution (1.258) represents the 
difference between the above result and total screening (1.259): 


fe (< — 1) (1.262) 


total’ r \& 


so that 


1 l 
— > (1.263) 
Eo + Epiezo E0 


These equations can be solved to obtain the piezoelectric contribution to the static dielectric 
function: 
EY 


= — 1.264 
Epiezo 1— Eoy ( ) 


The interaction term (1.258) is negative because the screening lowers the potential energy. 
The unscreened potential is just e?/r, and each bit of screening lowers it by an amount 
proportional to e?/r. 

The same acoustical phonon may interact with an electron by both the deformation and 
piezoelectric interactions. These two interactions do not interfere, to second order, because 
they are out of phase. The sum of the two interactions (1.249) and (1.256) gives 


1/2 
h _ 
H, = 3 (55) M(q)p(q)(aq + aa) (1.265) 
M(q) = D\q| + iM, (â) (1.266) 


The deformation potential is real, while the piezoelectric is imaginary. To second order, they 
do not interfere: 


IMI? = D? + M? (1.267) 


Since electron-phonon effects are usually evaluated only to second order, these two inter- 
actions can be treated separately. 


1.3.5. Polar Coupling 


The polar coupling between electrons and optical phonons can be very large in ionic 
crystals. The form of the Hamiltonian has been derived often, e.g., Fröhlich (1954). In ionic 
crystals some of the atoms are positively charged, while others are negatively charged. An 
optical phonon has the different ions in the crystal vibrating out of phase. When the plus ions 
and minus ions oscillate in the opposite direction, they set up a dipole polarization field. The 
polarization causes a long-range electric field which scatters the electrons. The electric field is 
the source of the polar coupling. 

The polar coupling is only to LO (longitudinal optical) phonons and not to TO (trans- 
verse optical) phonons (Mahan, 1972), because only the LO phonons set up strong electric 
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fields when they vibrate. These electric fields are in the direction of vibration, which at long 
wavelength is in the direction of the phonon wave vector q. For a system of no free charges, 


V-D=0= q. (E, + 4nP,)e" (1.268) 
q 


For an LO phonon mode of wave vector q, the electric field E, and polarization P, are both 
parallel to q. The formula for the electric field produced by the polarization is: 


E, = —4nP, (1.269) 
The next assumption is that the polarization is proportional to the displacement 
P, =eVQ, (1.270) 
P 1/2 
E, = —4ne UQ = -aieu (zt) Gay + ata) (1.271) 


where the coefficient U is to be determined. The phonon energy @;9 is assumed to be 
constant. Since the electric field points in the direction of q, it may be expressed by a 
potential: 


E = -Vo = -i $ "apa (1.272) 
q 
4nteU h ve 
— iqr me ^ t l 
o(r) Le 7 (2) q(aq + aq) (1.273) 


which gives the potential produced by the LO phonons. The interaction constant U still needs 
to be determined. Its value is obtained by considering the potential between two fixed elec- 
trons that was calculated from our relaxation energy (1.247): 


2 h dq eat 
V(r) = ———(4nevy’ l 1E (1.274) 
Oro 2PMq} J (2T) 4q 
The Fourier transform integral again produces a Coulomb potential: 
o2 
Vr(r) = T— (1.275) 
4 2 
pact (1.276) 
POZO 


This interaction energy is again interpreted as a contribution to the dielectric screening of the 
solid. This term represents the contribution from the optical phonons. It represents the 
difference between screening with just the electronic interband part ¢,, and the interband plus 
optical phonons £y: 


2 2 | 
fe (= — r) (1.277) 
rE r EQ 
1 1 
Ta = (1.278) 
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The unknown factor U is 


2 

po 1 1 

U= (—-=) (1.279) 
CO 


It is possible to write the electron—phonon Hamiltonian in the form 


M 
H,, = Y — pqa, + at.) (1.280) 
P aR too 
M? = 2ne*ho,0 (> — =) (1.281) 
Eoo fo 


where the matrix element M is given in the second line as M*. This form is a very sensible 
way in which to express the matrix element. The dielectric constants €) and £ are both 
measurable: £ọ is the low-frequency dielectric function measured by putting the solid between 
the parallel plates of a capacitor at low frequency, while € is the square of the refractive 
index. 

Besides these quantities, the matrix element depends only on the charge e and the LO 
phonon energy 4@,,. In spite of the elegance of this simple form, the matrix element is not 
usually expressed this way. Instead, it is customary to introduce the dimensionless polaron 


constant a, defined as 
1/2 
wae (_m_)\ (1 _1 (1.282) 
h \2horo Eon E 


Correspondingly, the interaction matrix element is 


B 4rah (A oro)” 


2m 


The reason for introducing & is simple. The self-energy expressions will have a simple and 
elegant form with this choice of a. 

Electrons in crystalline energy bands have their motion determined by an effective band 
mass m,. The band mass enters into the definition of a, which has the obvious disadvantage 
that one does not know the value of « until one knows the value of m,. 


M? (1.283) 


1.4. SPIN HAMILTONIANS 


The study of spin systems forms a very large part of many-body theory. There are many 
solids which display magnetic ordering among the electrons. There are many impurity 
problems where spin plays an important role. In attempting to explain these phenomena, 
many different types of spin models have been introduced. Some of these involve localized 
spins interacting among themselves, while others have localized spins interacting with free 
electrons. It is a subject in which there are few exactly solvable models which are nontrivial. 
Only a few of the models have solutions which are well understood, in spite of the fact that 
many of them have been intensely studied. The transference of model results to real solids 
which are strongly interacting systems has still not been very successful. 
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The commutation relations for spin one-half operators are (A = 1) 
(S, SP] = 188, 
[sP S91 = 18%, 
[S)?, SP] = iS, (1.284) 


The subscript label / or j denotes spin site. Spins on different sites, or with different electrons, 
commute. The superscripts (x), (y), (z) refer to space coordinates. These spin operators are 
often represented by Pauli spin matrices, 


salt 0 l 
2\1 0 

st 0 —i 
2\i 0 


a 1/1 0 
sO =- (1.285) 
2\0 =i 


but these representations are unnecessary for many-body calculations—the commutation 
relations are sufficient. It is customary to introduce spin raising and lowering operators: 


SP = si + is” (1.286) 
Si? = Ss — is?” (1.287) 


The names raising and lowering are applied to these operators because they raise or lower the 
magnetic quantum number m of the spin state. For spin one-half, there is only spin up |+) and 
spin down |—), and the operators go from one to the other: 


SOP |-) =+) SOl+) = I-) 
SY|4)=0, SO|-) =0 (1.288) 
By direct multiplication, for two operators on the same site: 
SHEO = S2 4 502 _ Ts@Os — SOSO] = S2 + 50 + sO 
SOSH = O2 4 902 4 iSO — SOS = s@2 4 02 _ sO (1.289) 


By subtracting these two results, the commutation relations are found among the raising and 
lowering operators: 


[s)?, SO] = 25), 
sP, $9) = s8, 


[S\?, SO] = -sS (1.290) 
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The last two commutators can be easily obtained from the direct definition of the operators; 
for example, 


ISP. S1 = SP, SPL + AS”, I= IS — PS? = s” 


Spins are neither bosons nor fermions. These commutation relations are unlike any which we 
have encountered previously. It is precisely these commutation relations which make spin 
problems so difficult. 


1.4.1. Homogeneous Spin Systems 


The Heisenberg Hamiltonian puts one spin on each site of a lattice and has the spins 
interact with a vector interaction. If the interaction is only between nearest neighbor spins, the 
Hamiltonian has the form 


H=-JYS;-S453=—J rl SiS), + SPS). + SSE 5) (1.291) 
jõ 


Again the notation j + 6 means the neighbors of site j. The Heisenberg Hamiltonian is often 
solved for spin greater than one-half or for coupling between spins which may be further 
neighbors. But the above form is the most common. The word solve means “approximately 
solve,” since it cannot be solved exactly, except in one dimension. Often the coupling constant 
in one direction, say z, is taken to be different from those in the other directions. This case is 
the anisotropic Heisenberg model: 


_ O w a e0) g0) 
oe Sis =I EIS Sj Sga +57 Sigal 
~J LS) S LES Ss (1.292) 


In this case the operators si? and Si can be arranged in any order. They commute 
since they refer to different sites. There are two limiting cases of this Hamiltonian which have 
their own names. The Ising model has J, = 0 


H; = -J £ SP S9 (1.293) 
jò 


j+ò 


It may be solved exactly in one dimension, even if one adds a magnetic field to the 
Hamiltonian: 


Ay = —Jy 2 SP Shs — Ho ES (1.294) 
J J 


where H is the magnetic field in units of joules. In two dimensions, it may be solved exactly 
without the magnetic field, as shown by Onsager (1944). Very accurate three-dimensional 
results have been obtained by a variety of techniques, including Green’s functions (Callen, 
1966) and critical point and renormalization group techniques (Domb and Green, 1972- 
1977). The XY model has only J,: 


Hy =—J, DS Srs (1.295) 
J 


It may be solved exactly only in one dimension. 
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It is conventional to write the Hamiltonian in (1.291) with the negative sign in front of 
the interaction term. Then for J > 0, the spins tend to line up all parallel, which is the 
ferromagnetic arrangement. For J < 0, the ordering has alternate spins up and down, if the 
lattice permits, which is called antiferromagnetic. 

The difficulty with solving spin problems is well illustrated by defining collective 
operators. The operators are transformed into wave vector space: 


ik-R. —ik-R. 
K = LBS, S =R KR, çH (1.296) 
— l KR. o(— ~ l ik-R, o(— 
s =R?" KRO, gl ony, ae (1.297) 


This transformation appears to be a reasonable approach. It was used successfully in solving 
fermion and boson problems. But examine the commutation relations for these operators: 


— l ik-R,—ik’-R: — 
s, S£ )] — woes k Rs, st )] 


2 , ' 
[s, SP] _ = y eR (k-k se?) (1.298) 
N} 
The operators s and so commute except on the same site, and then their commutator in 
(1.290) is 25, The right-hand side of (1.298) is not simple. It would be preferable to find 
something like 


ISP, SO] = CB (1.299) 


which would indicate that so and Se? were independent operators except for k = k’. They 
would behave just like bosons, and boson statistics would be used. Unfortunately, (1.298) 
does not have this property. One common approximation (Callen, 1966) is to replace (1.298) 
by the approximate expression 


E lo i | 
[SP SO] ~ 28, =E SP = 25y (S) (1.300) 
J 


One then tries to find a self-consistent equation for the average magnetization (S®), but this 
approach is approximate. Except in special circumstances, the operators so and SO? do not 
describe independent eigenstates of the system. This lack of collective eigenstates is the 
difficulty with solving spin systems and why innocent-looking Hamiltonians like (1.295) are 
difficult to solve. Hamiltonians with bilinear boson or fermion operators may be solved 
exactly. That is not true for Hamiltonians with bilinear spin operators. 

If the two equations (1.289) are added together, then 


SOS 4 ISH = apse? + 50) = 218 -S — s0] (1.301) 
= 2[S(S + 1) — S™] (1.302) 

For spin one-half, sv = 1/4, which gives the result 
(SP, S} =23@-H=1 (1.303) 


The S and S© operators, on the same site, obey an anticommutation relation. Anti- 
commutation relations are associated with fermion operators. The spin one-half operators 
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behave as a fermion on the same site. For fermions, each site may have zero or one particle for 
each orbital and spin state. These two possibilities correspond to the two possible spin states 
of up and down. For example, spin down is equivalent to zero particles and spin up to one 
particle on the site. Equation (1.288) is analogous to the fermion relations 


C1) = 10), C710) = |1) 
C|0)=0, CÌ) =0 (1.304) 


Unfortunately, one cannot carry this analogy too far. On different sites, the spin operators 
commute, while the fermion operators anticommute. The spin operators are not fermions 
either. 

However, in one dimension, the spin one-half operators can be made into exact fermion 
operators. This transformation was discovered by Jordan and Wigner (1928). In one 
dimension, the spins are aligned along a chain. A new set of operators is defined: the old 
raising and lowering operators are multiplied by a phase factor which is dependent on spin 
site: 


— Zid; o(-) 
dj = eS; 
t — -ip e+) 
dj =e "S 
did; = SPSO = S(S +1) - S +88 =14 89 (1.305) 


The phase factor , is chosen to be n times an operator which measures the number of spin- 
up operators to the right of that position: 


j=! j-1 
j>l:ġ=r H+S) =n did, (1.306) 
k=l k=l 
The chain is numbered from one end, say the right, with site indices j = l; 2,3,.... Set 


), = 0. The dt operators are interpreted as creating fermion particles, and d'd, d, = n; is the 
number operator for each site. The phase ọ, is 7 times the number of such fermions to the 
right of the site j. The phase factor commutes with se , Since the operator ¢, is the number to 
the right and does not involve the number operator on ‘the same site. On the same site, these 
operators anticommute, 


(d, dj} ={S, SP} =1 (1.307) 


since that is the property of the spin operators themselves. On different sites, they also 
anticommute, with the help of this new phase factor. By taking the anticommutator, 


(dy, di} = eS etn S 4. ibn SH gibi gs) (1.308) 


Since / Æ m, assume that / > m. The phase factor @,, then commutes with st but p; does 
not commute with SS). The anticommutator is then 


(dy, dh} = el m[sO SH 4. enS en I) (1.309) 
The right-hand term contains the operator combination 


eo iM(1/2+5) SH gin(1/24S) _ gin (+) — sH (1.310) 
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The S operator must always raise the magnetic quantum number m by unity, so that S® on 
the left always measures one integer higher value than the same operator on the right of S™. 
One gets an extra phase factor of in, which changes the sign of the term. This phase factor Q; 
was chosen to produce this sign change. The anticommutator of the d operators is now 


{d,,d'}=0 forl>m (1.311) 


It is equal to the commutator of the S, SC operators, for different sites, which is zero. The 
d operators anticommute for both the same site and different sites. They are pure fermion 
operators and obey fermion statistics. In one dimension the XY model (1.295) may be 
transformed into a Hamiltonian in terms of the d operators, 


= FSS + SPS?) (1.312) 
= -J Y (dadie md, + idle t-d) (1.313) 
J 
= Edje dy +d e d) (1.314) 
Ha = -J Yd dy + did) (1.315) 


which is just the tight-binding model for fermions. The phase factors vanish because in the 


first term n, is zero if it precedes a raising operator, and in the second term it is zero if it 


follows a lowering operator. Next the Hamiltonian is changed to collective coordinates: 


1 , 
d, =—— 5 e “id, d= e"id, 
spe aey 
1 , 
d! =Y edt, ed! 
a aa 
H=-2J 2 ydd, Yı = cos(ka) (1.316) 


and the collective operators now obey anticommutation relations as well: 
l . 1 , 
{d;, dj.) =>), eR ER) id, dt} = — ll) x By, (1.317) 
Nj N‘ 


The Hamiltonian may be written as a simple fermion problem. The exact partition function is 
Z = Tre” = TI (1 + e” $) (1.318) 


Note that there is no chemical potential for spin systems. One can also work out other 
properties of this model. To prove that the transformation (1.305) is valid, one should show 
that all the commutation relations (1.290) are preserved. They are. 

The Jordan-Wigner transformation shows that in one dimension the spin one-half 
operators may be represented exactly as fermions. This result is not valid for higher 
dimensions. No one has been able to find an equivalent transformation for two or three 
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dimensions. Indeed, most approximate analyses assume that in two or three dimensions the 
spin excitations behave approximately as bosons rather than as fermions. 

In two and three dimensions, one may still transform spin one-half operators into particle 
operators. In this case, the particle operators have funny commutation relations. They are, like 
spins, neither fermions nor bosons. This transformation associates the creation operator with 
S and a destruction operator with SO: 


Cc) = sS 
C, = Ss? (1.319) 
n = CEC, = SMSO =14 sO (1.320) 


This transformation preserves all the commutation relations (1.290). For example, one has 
that 


{C,, Ch} =1 (1.321) 
[C,, Ch] = -2G — 1))8in (1.322) 
[C;, n} = [SO, s®] =C; (1.323) 


The particle operators C; and CÌ, anticommute if they are on the same site and commute if 
they are on different sites. They are neither fermions nor bosons. Collective operators such as 


C, = E ae (1.324) 


have funny commutation relations, similar to the spin case in (1.298). Nevertheless, this 
Hamiltonian is a popular many-body model for certain systems. These are lattice gas models 
for atoms on lattices. The atoms may be considered as classical particles, which commute on 
different sites. However, there may not be more than one atom on each site, since the atoms 
are large, substantial objects. “No more than one atom on each site” is an exclusion principle, 
which is represented by the anticommutation relations on the same site. The same physics is 
contained in a model which has the particle obey purely boson statistics but with the provision 
that there is a strong repulsive interaction U if two particles were on the same atomic site. The 
lattice gas results would be obtained in the limit U —> oo. The statistics of anticommutation 
relations on the same site merely represent the strong repulsive interaction between atoms at 
close separation. 

The usual lattice gas (LG) model has pairwise interactions U between particles in 
nearest neighbor positions. A chemical potential p is also introduced for the particles, since 
they may have variable concentration: 


Kıs = Aye — = ZU dann M45 — bon, (1.325) 
j 
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The factor of one-half is due to counting each pair twice in the summation over all (75). The 
lattice gas Hamiltonian may be transformed into an equivalent magnetic problem by inverting 
the transformation of (1.319): 


1 l l l 
K = -U SO + s — (z) _ 
LG = 5 Ds +5 )(s +5 +5 22 S +5 (1.326) 
l UZ 
Kig = 5 UES Sas + (= -u) Ls? — mE 7 n) (1.327) 


where Z is the coordination number, i.e., the number of nearest neighbors. The spin version 
of K;g 1s identical to the Ising model with magnetic field. The magnetic field is 
Hy = UZ/2 — wp. If exactly one-half of the sites of the lattice gas are occupied, then one has 
u = UZ/2 so that the effective magnetic field is zero (Hill, 1956). In this case, for a half- 
filled band, the chemical potential is temperature independent. For concentrations other than 
one-half, the chemical potential varies with temperature. 

The quantum lattice gas (QLG) model adds a nearest neighbor hopping term to the 
lattice gas model: 


l 
Kore = Aorg — uN = J U 2- Mjjys + wÈ C!C — uJ n; (1.328) 
J J J 


One can show that the equivalent magnetic problem is the anisotropic Heisenberg model with 
magnetic field. The quantum lattice gas was suggested by Matsubara and Matsuda (1956) as a 
model for quantum fluids such as “He. The superfluid transformation for this system occurs in 
the liquid state. Nevertheless, a quantum lattice gas model appears to be a good description of 
its critical properties. The parameter U may be taken to be either positive or negative, 
depending on whether the nearest neighbors repulse or attract each other. The equivalent 
magnetic problems are then antiferromagnetic and ferromagnetic, respectively. 


1.4.2. Impurity Spin Models 


The models mentioned so far are for homogeneous magnetic systems. The same spin 
was on each site, and we tried to deduce the magnetic properties of the entire system. Other 
kinds of popular models are for impurity spin problems. Here the spin is an isolated impurity 
in an otherwise homogeneous electron gas. One can study, for example, the conditions for the 
formation of a local moment on the impurity or the scattering properties of the free electrons 
from the localized spin. 

The derivation follows Kondo (1969). An impurity atom is located at the position R, and 
has a localized electron orbital ġ; (r — R,,) when the electron is on that site. Otherwise the 
electron is in a continuum state k with wave function ,(1r) and energy &,. The wave functions 
may be considered as plane waves or alternately as Bloch functions of the crystal. A 
generalized state function is the summation over all possible states, 


y(r) = 2 Pkr)Xo Cko (1.329) 


where the X, are the spin wave functions, which denote spin up X, or down X,. The wave 
functions ,(r) and ;(r — R,) are not assumed to be orthogonal. This step will be taken 
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later, since their orthogonality is a many-body problem of sorts. The Hamiltonian is taken to 
have terms such as 
2 2 
Di lve 
H= — + U(r; ~) — 1.330 
oF + Ue] +525 (1.330) 
where U(r) is the potential for each electron, which may include the impurity potential as well 


as the usual potential of the host lattice. The last term is the electron—electron interactions. 
The first terms of the Hamiltonian are evaluated for the state function, and one gets 


2 
G ny' (r) É + 73] y(r) = 2 EKC Cko + 2 EL Cle Cio 
m ko re) 
+ Use Ch Cko 
kk’o 
+ E My(ChoCta + CioCko) (1.331) 


The first two terms on the right are the unperturbed parts of the Hamiltonian for the con- 
tinuum states and the impurity. The impurity states are labeled (Lo). The third term is 


L Uw Cto Cko (1.332) 
kk’o 


which involves the scattering of the continuum functions from the impurity potential. This 
problem is simple to solve by ordinary scattering theory. The solution will be given in Sec. 
4.1. The last term is a mixing term called Họ: 


Hy = Y My(CkCro + Co Cko) (1.333) 
Oo 


The interaction Hj, is sometimes called H,, but that name is used for another contribution 
which will be discussed shortly. The mixing term describes processes whereby the electron 
hops off of the impurity and becomes a continuum state or vice versa. This term essentially 
arises from the nonorthogonality of the continuum and local wave functions. Note that the 
spin direction is maintained during this hybridization process. A Hamiltonian of the type 


H = X &kCloCko + © ELC} Cro + L Mil Che Cts + C} Cyc) (1.334) 
ko (eJ o 


will be called a Fano—Anderson model, since it was introduced simultaneously by Fano 
(1961) and Anderson (1961). It should not be confused with the famous Anderson model, 
which will be described below. The Fano—Anderson model may be solved exactly, and this 
solution will be presented in Sec. 4.2. There is no real conceptual difference between (1.331) 
and (1.334). If the wave functions (r) are chosen to be eigenstates of the Hamiltonian 
which includes the impurity scattering U,,, then the two Hamiltonians become identical. 

The interesting magnetic phenomenon comes from the terms involving electron—electron 
interactions. Consider 


e? 


Ir, — r| 


Wa WO y ryle) (1.335) 


[dnar 
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Many types of terms are generated by this expression. Only two of them are discussed here. 
The first contains operators from two continuum functions and two localized wave functions: 

CÌ Cro Chon Cron (1.336) 


The four spin operators must occur in pairs. One possible pairing (o = 0’, o” = 0”) gives 
the interaction 


> Vix Ch. Cre ci Cko’ (1 .337) 
kk’oo’ 
Vw =e O [A l$l RP (1.338) 


This term is usually ignored, since it does not cause magnetic phenomena. It states that the 
conduction electrons interact with the impurity in a different way when the localized orbital is 
occupied. It is quite a reasonable term, since the electron is charged and one expects that the 
presence of this charge will influence the other electrons. This term is customarily ignored. 
Another possible spin arrangement is the exchange term for the above process, which has the 
form 


t f 
> J kk’ Cio Cio Cko Cko’ 
kk’oo’ 


Jie = e [Pr oxerore — R) | ry] Oe? R,,) 0, (r2) (1.339) 


In this term, the electron which is scattering may change its spin state during the scattering 
process. This spin change is always accompanied by an opposite spin change of the impurity 
spins, so that the total spin angular momentum is conserved during the process. The spin 
conservation is illustrated by writing out in detail the terms which can occur: 


Cha Cun Cha Cap + Ch, Ce, Ch Cay + Ch, Ces Ch Cry + CE, Ce, Ch Cry 
The first and last terms are regrouped as 
(Chg Cren + Chy Ce Cry Cap + ChCh) 
The first term in (1.340) contains the factor 


Ch. Cry + Ch Cu = E Cls Cro (1.341) 
Oo 
which is always unity if the localized state is occupied. In fact this first term has the form 


> Sx’ Ct Cro Clo Cpo (1 .342) 


k'o’ 


A 


which is exactly the same as (1.337). The usual convention is to combine these terms, and 
then ignore both of them. The second term in (1.340) has the combination 


SP =C} Cr — C, Cy) (1.343) 
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which is just the z component of the localized spin. Similarly, some of the other combinations 
are identified as the raising and lowering operators for the localized spin: 


SP = C$ Ch (1.344) 
S° = cC (1.345) 
L LY <Lt 


The spin-dependent terms in (1.339) may be collected as 


Ha = — Yo Sac (SP (Ch Cag — C, Cy) + SIP CE, Cer + SCE, Qe] 
kk’ 


This interaction is called H,,. It is a model of the localized d-electrons interacting with the s- 
like continuum wave functions. This form of the interaction is valid even if the localized spin 
has S' > L, The last two terms flip the spin of the continuum electron while flipping the 
localized spin of the impurity in the opposite direction. The first term does not flip spin, but 
the interaction does depend on the z-component of the spin of both the impurity and con- 
tinuum wave function. A Hamiltonian of the type 


H = 2 Ec ChgCko + € (Mp4 + zy) + Hoa (1.346) 
oO 


is called the Kondo problem. It was not formulated by Kondo (1964) but rather much earlier 
by Zener (1951). Kondo’s contribution was recognizing that the spin-flip scattering processes 
could cause unusual low-temperature behavior in the scattering properties. These low- 
temperature anomalies had long been observed in resistivities (Gerritsen and Linde, 1951, 
1954). They are derived in Chapter 6. Another type of term which may arise from the 
electron—electron interaction (1.335) has four local operators: 


UC] Cl Cio" Cho” (1.347) 


o2 

U = [Prd roldutr — R,)/° Ir, r] lb, (r. — R,)/ (1.348) 
1— 1 

If the orbital is nondegenerate, as in an s-orbital, then two electrons can be on the same site 

only in opposite spin states. The spin indices must pair up as (o” = o, o” = 0’). One gets a 

term similar to the one in the Hubbard model, 


Unityn, (1.349) 


although it was historically introduced first in the Anderson model. This famous model 
considers the model Hamiltonian 


H = » Cho Cko + EL 2 CÌ Ciro 
ko o 
+ Z Mk(ChoCe + CÌ Cko) + Un nz, (1.350) 
oO 


The Hamiltonian is exactly solvable without the last term. The last term is very important, 
since it causes magnetic instabilities in some circumstances. The Hamiltonian describes the 
interaction of the conduction electrons with a single impurity at site R,. The index L denotes 
the orbital state of the impurity. If there is only one important orbital state, as is the case for s- 
states, then the indices are often dropped from the notation. 
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The Anderson model and the Kondo model both describe the interaction of a continuum 
electron with a localized one. The two models are not totally different. There is a canonical 
transformation which, when applied to the Anderson model, will transform it into a form 
similar to the Kondo model. This transformation on the Anderson model produces quite a few 
terms, of which the Kondo model is a subset. The transformation does not produce exactly 
the Kondo model, and the two models are not identical. Write the Anderson model as 
(CL, = = C;, ny = cic) 


H =H) + Hy (1.351) 

Ho = 3 £4. Cho Cus + €r X ChoCro + Ungni (1.352) 
(oJ (6J 

Hy = Z Mk (CkoCro + C} Cko) (1.353) 


then a canonical transformation (Schrieffer and Wolff, 1966) is 
H = He = Hy + Hy + [S, Ho] + [S, Hy] + 31S, [S, Holl 
+518, [S, Hull + +-+: (1.354) 


The function S is chosen so that all terms are eliminated which are linear in M,. It is 
accomplished by choosing S to be 


0 = Ay + [S, Ho] (1.355) 
and then the canonical transformation produces the series 
H = Hy +2[S, Hy] + +S, [S, Hyl] +- (1.356) 


There are an infinite number of terms in the series. Only those which are proportional to MŽ 
will be evaluated here, which is the second term in the series (1.356). The transformation 
factor S turns out to be 


l—n 


S = =m ( 4s —\ClGy- Ch C.) (1.357) 


E, — Ek EL +tU—e 
so that one gets the commutators 


n_ 
z e, SFU Te 


|s. E ewro | = om -)da + ci, Cs EK 
ko 


n 


l Ls n 
[sagn] = -a pml + Gg wL 


l—n n_ 


—S 


Ep — Ek 1U 


[S, Unn ]=-UZM,( -Not Ch C,)n_, 


When these are added, one exactly satisfies (1.355). Although the form of S appears 
unwieldly, it gets the job done. The next step is to take the commutator [S, Hy] to generate the 
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terms which are proportional to M?. As a preliminary step, define the following effective 
exchange constant: 


l 1 l 1 
Jig = M My + ——_ —- ——__  - ———] 
Ez — Ek EE, -—Se € +U-gE €+U-6e, 


In terms of this constant, one finds the following for the commutator: 
[S, Hy] = E Jiws Chs Cws + Ci, Cr-sC}sC; — Sie nss 
kk’s 


M? 


+4(C}_ CE CC, + h.c.)] +20 
1 1 
— ŞC Co M.M 1.358 
2 ks “k s+*k (t) ( ) 


The second term flips the spin of the electron and impurity electron while scattering—it has 
exactly the Kondo form. There are many other terms. The last term is potential scattering of 
the continuum electron, which must be added to our earlier result (1.331). There is a term 
which is the non-spin-flip interaction between a continuum electron and an impurity electron 
and this must be added to (1.337). There is also a term which contains the interaction between 
two impurity electrons, one with spin up and the other with spin down. This renormalizes U 
in (1.349). Finally, there is a new term which has not been previously encountered. It 
describes the process 


Ci CL CC, (1.359) 
whereby two impurity electrons hop off of the impurity site to become two continuum 


electrons and vice versa. The Anderson model certainly describes a rich set of phenomena. Of 
course, many more terms are generated by the additional commutators in the series (1.356). 


1.5. PHOTONS 


Throughout this book the following type of Hamiltonian will be used for the discussion 
of the interaction of charges e, with each other and with a radiation field. The raising (ai) 
and lowering (a,,) operators are for the photons: 


l e; 
H = -Jp -ĉa + +Y oaa 1.360) 
2 Im p c ( HEr = 5 3 kK * kA kA ( 
The radiation field is represented by the vector potential 
] 27 ve i(k-r—@,t) 
-Ar t) =$ (—]  &.k (aye k” + hic.) (1.361) 
C kr \VOk H 
I ik-r 
= —)_ e" A,(k, t . 
Fae’ u(k, £) (1.362) 
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The term >> Okai Ayn represents its unperturbed photon Hamiltonian in the absence of 
charges. The unit polarization vector is ¢,. One feature of this Hamiltonian is the term 
l eje; 


~ ae 1.363 
2% -T (1.363) 


which is the Coulomb interaction between charges. The Coulomb interaction is instantaneous 
in time, since the potential has no retardation, or speed of light, built into it. The lack of 
retardation is not an approximation but is rigorously correct in the Coulomb gauge. Of course, 
there is retardation in the total interaction, which arises through the vector potential fields. 


1.5.1. Gauges 


Although the topic of gauges is treated correctly in a number of texts, it still seems to be 
poorly understood by students. It seems appropriate to start at the beginning and reproduce 
some standard material. Maxwell equations are 


V-B=0 (1.364) 
1 3B 
VxE=--— (1.365) 
c ot 
V-E=4np (1.366) 
13E 4 
VxB=- 4; (1.367) 
c ot C 


The charge density is p(r, t) and the current density is j(r, £). There is an important theorem 
that any vector function of position can be written as the sum of two terms: one is the gradient 
of a potential, and the other is the curl of a vector: 


S(r) = Vg + V x m(r) = S, + S, (1.368) 
S; = Vg (1.369) 
S=Vxm (1.370) 


The term S; is called the longitudinal part of S, and S, is called the transverse part. If B(r) is 
assumed to have this form, then Eq. (1.364) becomes 


V-(Vg+VxA)=V’*g =0 (1.371) 
Usually g = 0, so that the vector potential is defined as 
B=VxA (1.372) 


However, this expression does not uniquely define A(r). The definition B = V x A is put into 
(1.365): 
V x E424 = 0 (1.373 
cat | 373) 


Now the factor in brackets is also the sum of a longitudinal and transverse part: 


Vx[VWW+VxM]=0=Vx(VxM) (1.374) 
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The equation is satisfied if M = 0, so that for the electric field is 


E = —--— -Vy (1.375) 


where \ is the scalar potential. When these two forms for B(r) and E(r) are put into (1.366) 
and (1.367), the equations for the scalar and vector potentials are: 


lod 

2 

~_V.A=-4 1.376 

V Yzy TP ( ) 
1 & ð An. 


At first it appears that there are four equations for the four unknowns (A,, A,, 4z, Y): Eq. 
(1.376) and the three vector components of (1.377). If this assertion were true, the four 
unknown functions would be determined uniquely in terms of the sources (j, p). However, 
these four equations are not linearly independent; only three of them are independent. The 
linear dependence is shown by operating on (1.376) by (1/c)(0/dt) and on (1.377) by V. Then 
subtract the two equations and find 


Vv xa =-—(Pev-j) (1.378) 


The left-hand side is zero because it is the gradient of a curl. The right-hand side vanishes 
since it is the equation of continuity. The four equations are not independent. There are only 
three equations for the four unknowns. 

Therefore, the four unknown functions (A,, A,, Az, Y) are not uniquely determined. It is 
necessary to stipulate one additional condition, or constraint, on their values. It is called the 
gauge condition. The imposed condition is that the Coulomb field (r, t) shall act instan- 
taneously, which is accomplished by insisting that 


V-A=0 (1.379) 


Equation (1.379) defines the Coulomb gauge, sometimes called the transverse gauge. The 
latter name arises because (1.379) implies that V - A; = 0, so that A is purely transverse. One 
should realize that any arbitrary constraint may be imposed as long as one can satisfy (1.376) 
and (1.377). As long as these two equations are satisfied, one always obtains the same value 
for E(r) and B(r). The arbitrary choice of gauge does not alter the final value of observable 
quantities. 

In the Coulomb gauge Eq. (1.376) simplifies to 


VW = —4np (1.380) 
which is easily solved to give 
, P(t’, t) 
TODE |ë, roy (1.381) 


The potential W(r, t) is instantaneous and is not retarded. This result is not an approximation 
but is an exact consequence for our choice of gauge. Later it is shown that a different choice 
of gauge leads to a retarded scalar potential. 

Next the other equation (1.377) is evaluated. The following identity is useful: 


V x (V x A) = —V°A + VW(V-A) (1.382) 
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The second term vanishes in the Coulomb gauge, which gives 


1 8% An ow 
VA-——A=-—j+V— 
c? af? c j+ côt 
It is useful to operate a bit on the second term on the right-hand side. Using (1.381), this term 


iS 


(1.383) 


y?l _ Ly far , (0/dt)p(r", t) (1.384) 
cot c Ir — r'| 
ow l 3_/ l fers 
——L > V. l. 
Va sv fa r ror). jr, t) (1.385) 


where the last identity uses the equation of continuity. Integrate by parts in the last term, 


wy _! 


cot 


l 
v [erie t)-V’ (1.386) 


-r'| 


and then pull the gradient out by letting it operate on r instead of r’; the latter step requires a 
sign change. Also operate on the current term itself in (1.383) by using the identity 


r’, 
Anj(r, t) = —V? [ar ey H (1.387) 
By combining these results 
1? l KUDI 
2 2 3 
A-=—A= — d'r 1.388 
V c? Of? V -w7 c l Ir-r] ( ) 
Finally, using the identity (1.382), 
1 æ l jr’, t) 
2 3_/ 
—-—=—A=-- l. 
VA 2 ap mal x lv x |ë, r 2] (1.389) 


The point of this exercise is that the right-hand side of (1.389) is now a transverse vector, 
since it is the curl of something. The current is written as a longitudinal plus a transverse part, 


j=j +i, (1.390) 
Then the vector potential obeys the equation 
1 & An 
2 . 


where j,(r, t) is defined by the right-hand side of (1.389). The final equation for A is very 
reasonable. Since the vector potential A is purely transverse, it should respond only to the 
transverse part of the current. If it were to respond to the longitudinal part of the current, it 
would develop a longitudinal part. The longitudinal component of A does not occur in the 
Coulomb gauge. 

As a simple example, consider a current of the form 


jr) = Ine** (1.392) 
whose transverse part is 
i . 
i, = -7K x kx Jo)e™" (1.393) 
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FIGURE 1.2 


and j, is the component of j which is perpendicular to k; see Fig. 1.2. In free space, the 
transverse and longitudinal parts are just those components which are perpendicular and 
parallel to the wave vector k of the photon. However, solids are not homogeneous but 
periodic. Along major symmetry directions in the crystal, it is often true that transverse 
components are perpendicular to the wave vector. However, it is generally not true for 
arbitrary points in the Brillouin zone, even in cubic crystals. The words transverse and 
longitudinal do not necessarily mean perpendicular and parallel to k. 

Two charges interact by the sum of the two interactions: Coulomb plus photon. The net 
interaction may not have a component which is instantaneous. In fact, for a frequency- 
dependent charge density, at distances large compared to c/o, one finds that the photon part 
of the interaction produces a term —e*/r which exactly cancels the instantaneous Coulomb 
interaction. The remaining parts of the photon contribution are the net retarded interaction. 
Solid state physics is usually concerned with interaction over short distances. Then retardation 
is unimportant for most problems. In the study of the homogeneous electron gas, for example, 
the photon part is small and may be neglected. In real solids, the photon part causes some 
crystal field effects, which is an unexciting many-body effect. The main effect of retardation is 
the polariton effects at long wavelength (Hopfield, 1958). In general, the Coulomb gauge is 
chosen because the instantaneous Coulomb interaction is usually a large term which forms a 
central part of the analysis, while the photon parts are usually secondary. Like most gener- 
alizations, this one has its exceptions. 

The Coulomb gauge V - A = 0 is not the only gauge condition popular in physics. Many 
physicists use the Lorentz gauge: 


v.a +0 (1.394) 
cot 
which causes (1.376) and (1.377) to have the forms 
1? 
2 
——— W= 4 l. 
1 8 An 
2 . 
A-=~—A=-— . 
V 2 an 7 j (1.396) 


Now both the vector and scalar potentials obey the retarded wave equation. They combine to 
produce a four-vector which is invariant under a Lorentz transformation. The Lorentz 
invariance is very useful in many branches of physics, and the Lorentz gauge is used 
frequently. The Green’s functions are not given for this gauge, but they are provided in books 
on field theory. Obviously both the scalar and vector parts are retarded. 

Another gauge which is often used, but which does not have a formal name, is the 
condition that Wy = 0. The scalar potential is set equal to zero. In this case it is found that the 
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longitudinal vector potential is not zero but now plays an important role. In fact, the longi- 
tudinal part of the vector potential leads to an interaction between charges which is just the 
instantaneous Coulomb interaction. When = 0 the longitudinal part of the vector potential 
plays a role that is identical to that of the scalar potential in the Coulomb gauge. 

The Hamiltonian (1.360) is written in a gauge which has the scalar potential acting 
instantaneously, so that the Coulomb interaction is unretarded. This form of the Hamiltonian 
is consistent with either gauge V - A = 0 or W = 0. One gets a different Hamiltonian for other 
choices of gauge. 


1.5.2. Lagrangian 


So far it has been proved that the Coulomb gauge makes the vector potential transverse 
and that the scalar potential acts instantaneously. Next it is shown that the Hamiltonian has the 
form indicated in Eq. (1.360), which is done by starting from the following Lagrangian 
(Schiff, 1968): 


3 
L= X mv? + | [E(r)’ — B(r)’] — 3 ei [ve — Ly, ' Aac] 


This Lagrangian was chosen to produce Maxwell’s equations as well as the classical equations 
of motion for a particle at r;, with charge e; and velocity v,, in a magnetic and electric field. 
There are three variables n which are W, A, r;. In terms of these variables the electric and 
magnetic fields are: E = —A/c—Vw,B=V x A. Each variable n is used to generate an 
equation from 


doL è d òL òL 


—— — -L = 1.397 
Hh Sidr, 3Gn/ar,) SN el 
where the conjugate momentum is 
òL 
Pa == 1.398 
When the scalar potential is chosen as the variable n, 
ÒL 
— = —p(r) = — > e,d(r —r,) (1.399) 
oy 
OL E 
— Z 1.400 
o( oy / ax) 4r ( ) 
L 
Py -L 0 (1.401) 
OV 


This choice produces the Maxwell equation (1.366) from (1.397): 


V-E=4np (1.402) 
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When one component of the scalar potential such as A, is chosen as the variable in the 
Lagrangian, 


aL, 
“SA, = )x = 2 e,v;,0(r — r;) (1.403) 
ôL E, 
TE (1.404) 
òL B 


-Z 1.405 
6(0A,./dy) An l ) 
When this equation is used in Lagrange’s equation (1.397), the x-component of the Maxwell 
equation (1.367) is 


Vx B= +j (1.406) 


Choosing either A, or A, as the active variable will generate the y- and z- components of this 
equation. The Lagrangian does generate the two Maxwell equations which depend on particle 
properties. 

Another important feature is that the momentum conjugate to the scalar potential is zero 


Py =0 (1.407) 


The momentum variable conjugate to the vector potential is 
P,=-— (1.408) 


which is just proportional to the electric field. This relationship is important, since the 
quantization of the fields will require that the vector potential no longer commute with the 
electric field, since they are conjugate variables. 

The other equations generated by this Lagrangian are the equations for particle motion in 
electric and magnetic fields. Here the variable in the Lagrangian is the particle coordinate r,: 


OL Vis 

ro = ei Vale) = Ë Van) (1.409) 
ÒL e; 

öva = Pia = MV t z Ault) (1.410) 


In the last term of (1.409), repeated indices over 5 imply summation. The equation deduced 
from Lagrange’s equation is 


0 = £ | my, 4 ar) | 4 av] — Ly | ao) 


The total time derivative on the position dependence of the vector potential is interpreted as a 
hydrodynamic derivative: 


d ya ag VA 
dt ât Vi 
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Then the equation may be rearranged into 


my, = eE) + tivo .A)— v. val} = e| EE) ity x B| 
dt c c 


The last term is just the Lorentz force on a particle in a magnetic field since 
vx B=vx(V x A)=V(v-A)—-(v-V)A (1.411) 


The equation for mv is Newton’s law for a spinless particle in an electric and magnetic field. 
The Lagrangian is a suitable starting point for the quantization of the interacting system of 
particles and fields. 


1.5.3. Hamiltonian 


The Hamiltonian is derived from 
H =} Pn —L (1.412) 


where the first summation is over all of the variables and their conjugate moments. In the 
present problem this summation includes the vector potential and the particle momentum; the 
scalar potential has no momentum. The Hamiltonian has the form 


. . . 2 _ R2 
H= -Dpp -Îi Alr)| + ar] A. (cA+vv) E -7 | 


4rc c 
+ Dew) -— >| _ <i }-r‘ta | —fi | (1.413) 
fi 0 Fm4 P; C -mc P; c 
It may be collected into the form 
H == yp, -“a@)] + Deve) 
T m4 P; z i ~ i F; 


dr 1. 
+ g |e + B? —2Vy- (<A + wv) (1.414) 


Terms which are the cross product between the scalar and vector potential parts of the electric 
field always vanish after an integration by parts, 


[erv . A) = — [no .A)=0 (1.415) 


since the Coulomb gauge is used, wherein V - A = 0. The terms involving the square of the 
scalar potential may also be reduced to an instantaneous interaction between charges using 
(1.381): 


i ij Vi 


3, 3y e. 
[Fw w=- FWE |in Ees aao 
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With these simplifications, the Hamiltonian is written as 
1 e; 2 fdr 
= Elp: -Îi Alr)| +(e + B*] (1.417) 


The last terms are just the energy density of the electromagnetic fields. The first terms arise 
from the charged particles. This form of H is simple and instructive. However, our interest is 
in obtaining a quantized version. Write the electric field energy density as the separate parts of 
vector and scalar potential, 


d?r dri 1 
[Zeg À? + (VŽ += “Vy. a 


T 


-35 Eie E (1.418) 


and the scalar potential just gives the interaction between charges. 
The electric field is the conjugate momentum density of the vector potential. These two 
field variables must obey the following equal time commutation relation: 


4 (Kr, t), — l ree J = i54g5(r — 1’) (1.419) 


This commutation relation is satisfied by defining the vector potential in terms of creation and 
destruction operators in the form 


meN '/* | | | 
A,(r, t) = > ( y ) E (k, AJe™" (aye Om + a’ ee) (1.420) 
kÀ k 


There are two transverse modes, one for each transverse direction, and A is the summation 
over these two modes. The unit polarization vector €, gives the direction for each mode. The 
operators obey the commutation relations 


[aw Dy] = Sra Say’ (1.421) 
laka awx] = 0 (1.422) 
The time derivative of the vector potential is 
A,(r, t) = —i pii sa e rNe™* (ap e — a’ e~ ®t) 
The electric field is 
E (r, t) = — “A, ~ VW (1.423) 
The scalar potential is not expressed in terms of operators, since its conjugate momentum is 


zero. It does not influence the commutator in (1.419). The commutation relation (1.419) is the 
commutator of the vector potential and its time derivative: 


[A,(r, t), Aplr’, À] = 4ncidyg5(r — r’) (1.424) 
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The vector potential in (1.420) has been chosen to give just this result: 
; Ireo N 7 (2nhe2\ 
[4,(r, 1), Agr’, O] =i Yo (=) (=) 
x p OR v VO}, 
x £ (k, A)ég(k,’ AAETH EAS Say, 
= 4nicd,,5(r — r') (1.425) 
The various factors of 2m and @, which enter (1.420) are selected so that the commutation 


relation (1.419) is satisfied for the usual commutation relations for the operators a,, and a}. 
The expressions for the energy density of electric and magnetic fields have the form 


1 . how , , 
l d’r—~A*(r, t) = — > 7 K (appa pe + al at mee 
kA 


81tc2 T 
Mia, — 4 1.426 
MHA, T UA) (1.426) 
l he? | | 
[er 0 x A) = $ — (k x E) (apap 4 al at a Zita, 
ST kD 40, 
+ agai, + al a) (1.427) 


In the second term, the photon energy is , = ck. This term may be added to the first, and the 
aa and atat terms both cancel. Combining the results from (1.417) and (1.418), 


H=> l [ {aw} +52 €j; +a ala 
= FT 2m P; c ' 2il el ix KOKA KA 


This form of the Hamiltonian is just the result which was asserted in the beginning in (1.360). 
The vector potential in (1.420) is expressed in terms of the creation and destruction operators 
of the photon field. The photon states behave as bosons: as independent harmonic oscillators. 
Each photon state of wave vector k and polarization À has eigenstates pf the form 


t Ny 
ma) = L 19) (1.428) 


(ma) 


where ną, is an integer which is the number of photons in that state. The state |0} is the 
photon vacuum. The total energy in the free photon part of the Hamiltonian is 


_ I 
Fy |g, MPM +++ Pkn) = 2 MO (a, + a)IMk, ay Bho + Mk,2,,) (1.429) 


The Hamiltonian (1.360) has been derived for spinless, nonrelativistic particles. Certainly the 
most important relativistic term is the spin orbit interaction. The effects of spin also enter 
through the direct interaction of the magnetic moment with an external magnetic field. 
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1.6. PAIR DISTRIBUTION FUNCTION 


In crystalline solids, the atoms are arranged in a regular array. If R, denotes the position 
of the atoms, the summation of exp(iq-R,) over all the atoms yields zero unless q has 
particular values. These values are q = 0 or else one of the reciprocal lattice vectors G of the 
solid. In either case, the factor exp(iG - R,) = 1 so that the summation yields the number of 
atoms in the solid, which is N. This result is written as 


Le NY bys (1.430) 
J 


where q = 0 is in the term with G = 0. Many materials are not crystalline, e.g., liquids, gases, 
and disordered solids. In discussing these quantities similar summations over particle loca- 
tions are encountered. These summations still equal N for q = 0, but they must be evaluated 
for q Æ 0. This evaluation is done in terms of a function S(q), which is called the static 
structure factor or the static form factor. 

Define the density operator for the atomic locations: 


p(q) = > exp(iq - R;) (1.431) 
J 


p(q) is an operator because the R, describe the instantaneous location of each particle. It is 
assumed the particles are classical objects, so there is no need to deal with the quantum 
statistics associated with fermions or bosons. The average of this operator is defined as 


(p(q)) = | > exp(iq - R) (1.432) 
J 


where the average is taken over the various configurations of the atoms. Some of these 
averages are defined here, without explaining how to find them from first principles. The latter 
is done in standard references (Hill, 1956; Percus, 1964). 

The first average is over a single density operator: 


(p(q)) = Nõg=0 (1.433) 


(p) is zero for nonzero q because there is no restriction on the location of each atom. That is, 
in the averaging, each R, can be anywhere in the material with equal likelihood since edge 
effects are ignored. The averaging is equivalent to the integral over a continuous distribution 
of values: 


(p(q)) = S | bre = Nõg=0 (1.434) 


The integral on the right just gives a delta function at q = 0, which is the same result as 
(1.433). 
The second interesting average is over a product of two density operators: 
(p(q)p(q)) = dU (exp(ig - R;) exp(iq’ - R;)) 


ij 


(p(q)p(q’)) = N78 4 -08q—o + N8q4.q=-05(q) (1.435) 
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Of course (p(q)p(q’)) = N? when both q and q’ are zero. If both q and q’ are nonzero, the 
average vanishes unless q + q’ = 0. To prove this result, go to a center-of-mass coordinate 
system: 


Q 


Q=q+q, q=k+— (1.436) 
k=5(q-q), q= -k+ (1.437) 

where the average is 
eaa) = Zll Se- ass 


There is a strong correlation between the difference R; — R; of two particle locations, but 
there is no correlation for the center of mass R;+R,. The term exp[iQ - (R; + R,)/2] 
averages to zero unless Q = 0, which gives q = —q’ = k. Whenever q Æ 0 the average is 


(PDPC) = Sq+q'=0 2 (expliq > (R; — R;))) (1.439) 
ij 


There is strong correlation between the relative positions of two atoms at short distances. If 
one atom is at a spot, the other will not be at the same spot because atoms repulse each other 
at short separations. The static structure factor S(q) is defined as 


NBq-0 + Sq) = $ Dlexplia « (R, — R)) (1.440) 
ij 


1 
— at + 2 expliq (R; — RI) (1.441) 


One subtracts the term Nôõq-o- The right-hand side contains a summation over all particles at 
R; and then a summation over all its neighbors R; — R;. The first term has i =j and is 
independent of q. When this formula is averaged over all possible positions of the impurities, 
it should give the same result for each particle R;. The summation over R; just gives N: 


Nõo + S(q) = 1 + (= ei se) (1.442) 
AR 


The prime on the summation means that AR = 0 is excluded from the summation. The 
crucial average is the second term on the right. It averages over the relative positions AR of 
the atoms near to the one at R;. This average can also be expressed as a function of r. Define 
g(r) as the pair distribution function. It is defined as the probability that another particle is at 
r if there is already one at r = 0. The normalization is chosen to give g(r) — 1 at large 
distances. In terms of the pair distribution function, S(q) may be written as 


Nbg-0 + S(Q) = 1 +n | droes" (1.443) 
where n = N /v is the average density of the system. Also write 


Noga = 1 | re (1.444) 
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and transpose this expression to the right to give the equivalent equations 
S(q)—1 = n | dretg) = 1 


(r)—1 -ijs o TiS(q) — 1] (1.445) 
6 on (2n)° É q l 
The inverse Fourier transform may now be taken to give g(r) as a function of S(q). Both g(r) 
and S(q) go to unity at large values of their arguments. The functions [g(r)— 1] and 
[S(q) — 1] are both quantities which are short-ranged in their arguments. Their mutual Fourier 
transforms are well defined. 

The function S(q) is determined experimentally by scattering from targets. Afterwards, 
g(r) is found by a numerical Fourier transform. In liquids they depend only upon the 
magnitude of their vector arguments: g(r) and S(q). The well-known results for liquid “He are 
shown in Fig. 1.3, which is from Mozer et al. (1974). The pair distribution function g(r) 
should vanish according to the dashed line at short distances because the atoms do not 
penetrate each other at the low thermal energies of the liquid. The numerical transform shown 
by the solid line has a spurious peak for R < 2 A because of inaccuracies in S(k) for k —> ov. 

The higher moments such as (p(q)p(q’)p(q”)) would also be interesting were they 
known. But they are not easily obtainable from experiments nor from theory. 

In the model of the homogeneous electron gas, the electrons are not ordered. In this case 
one can also describe relative behavior of pairs of particles by a pair distribution function g(r). 
This quantity is always defined as the average over the motion of the particles. In a quantum 
system the diagonal density matrix is defined as the square of the N-particle wave function: 


Py(tiro +: ry) = |W(ryr, -: ry)? (1.446) 


It is normalized so that unity is obtained when integrating over all the coordinates: 


1 = [dnar e... dPrypylrir, eee rv) (1.447) 
1.8 
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FIGURE 1.3 g(R) (left) and S(k) (right) for liquid “He. S(k) is obtained by X-ray scattering measurements, and g(R) 
is found by the numerical transform. Uncertainties in S(k) at large k produce the extraneous structure in g(R) at low R, 
which is to be ignored. The dashed line shows the proper extrapolation. p = 0.1628 g/cm’, T = 2.86 K. Source: 
Mozer et al. (1974) (used with permission). 
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Since the many-particle wave function ¥ is either even or odd under the interchange of two 
identical particles, the density matrix is unchanged by this operator. Any two particles can be 
called r; and r,. Their relative behavior, after averaging over all the other particles, is 
obtained by integrating over the coordinates of all the other particles: 


P2(r1, r2) = [drar i ËPrypyltir ry) (1.448) 
0,(r,, r2) is the average two-particle density matrix. The “integration” over other particles 


included spatial integration, spin averages, etc. Similarly, the one-particle density matrix is 
that obtained by integrating over all but one of the particles: 


pi) = | Prodr, Pry Py (tyre + Ty) (1.449) 
= | Prao, r2) (1.450) 
| = [er o(r,) (1.451) 


For homogeneous systems, the two-particle density matrix p,(r;, r2) can only depend on the 
relative positions r; — r, of the two coordinates. This quantity must be related to the pair 
distribution function. They are in fact proportional. The constant of proportionality is 
determined from the condition that g(r) — 1 at large separation, so 


N(N — 1)p2(r1, r2) = n’g(r, — r3) (1.452) 


As an example of particular behavior, evaluate g(r) for electrons using the Hartree-Fock 
approximation. The many-particle wave function is just an N-dimensional Slater determinant, 
so the N-particle density matrix is just the square of this determinant: 


W071) Wi)... Wr) 


1 Wy, (r2) Wy, (r2) vee Wy, (2) 
Py(Filp -+ ry) = NI . , 7 . (1.453) 


W,, y) Y (ry) vee Wy, Ow) 


The single-particle orbitals \, (r,) are assumed to be orthogonal for different states À. A 
theorem states (see Parr, 1964) that if one integrates over all but two coordinates, the two- 
particle density matrix, from this Slater determinant, is the sum of all pairs of two-particle 
wave functions: 


2 
1 


N(N — 1),%,2! 


We) Wad) 
Y, (r2) Va, (r2) 


The summation is taken over all pairs of occupied states. The two-dimensional Slater 
determinant can be expanded to give the following equivalent result: 


P21, r2) = (1.454) 


1 
P2(r1, r2) = WNN = 1),25, V ED, C2) — Wa, ry, DP (1.455) 
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A further integration gives the one-particle density matrix as the summation over all occupied 
states of the square of the wave function: 


li 2 2 
pi) = 2 Nh, OF (1.457) 
y 


These theorems apply to any Hartree-Fock system with any orbitals. For homogeneous 
systems, it is assumed the wave functions are translationally invariant. If they are plane waves 
with spin up &, the two-particle density matrix is 


eik-r 
Y= 4 (1.458) 
Jv 
7 1 i(k—k’)-(r— 
p2(r,r) = VN Dl — eO Da by (1.459) 
kk’ 


Using (1.452) gives a pair distribution function of the form 
1 k 
ger -—-r)= z Yu — HOM a, a, (1.460) 
kk’ 


In most systems, the electrons do not all have spin up, and one also has to average over all 
spin coordinates to get g(r). This procedure is discussed in Chapter 5. 


Problems 


1. Solve the classical vibrational modes of a one-dimensional chain of atoms of type A and B. They 
alternate on the chain with masses m, and mg. The harmonic spring between atoms has spring constant 
K. 


2. Write down the Hamiltonian of Problem 1. Solve it, and show that it may be reduced to the form 
H= 3 oplat ap +4) (1.461) 

k 

where the @,, are the classical normal modes. 


3. Find the exact solution to 
H = Eata + E (atat + aa) (1.462) 


where E, and E, are constants and a and a’ are boson operators. 


4. Solve the Hamiltonian below with a canonical transformation: 
H = Eya'a+F(a+a') (1.463) 


H = eHe”, s = Ma — a’) (1.464) 
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a. Show that à = A* since H is Hermitian. 

b. Use the expansion (1.356), and show that only a few terms in the series are finite—the 
remainder vanish. 

c. Find the choice of à which reduces H to (1.50). 


5. Consider a fermion system which has three energy states with eigenvalues E£}, E>, and E}. There 
also exist matrix elements which connect these states and permit transitions between them: M,,, M53, 
and M,, 


(a) Write down the Hamiltonian for this system in terms of creation and destruction operators. 
(b) Determine the eigenvalue equation for this system. 


6. Consider a tight-binding solid which has alternate atoms of type A and B. The electron Hamiltonian 
in the nearest neighbor model has the form 


= w > (a'b +b} ṣa) + D[Bbib; + Aa;aj) (1.465) 
ið i 


where a; and b; are electron operators for atoms of type A and B. Find the exact eigenvalues of this 
Hamiltonian. 


7. Calculate the exact partition function 
eo P2 = Z = Tr e BMY) (1.466) 
for the Hubbard model (1.182) in the atomic limit. Then give the expression for the average number of 


electrons N = —(8Q,/dp). 


8. Take a three-atom chain with periodic boundary conditions. Each atom has one orbital state with 
two spin configurations. 


(a) Solve the nearest neighbor tight-binding model for zero, one, two, or three electrons with spin 
up, and give the partition function. Show that it is the same as (1.176). 
(b) Give the partition function for the XY model, and show that it is different. 


9. Give the partition function for the Hubbard model for a two-atom chain with periodic boundary 
conditions. 


10. What is the energy current operator jp for the Hubbard model (1.182)? 


11. Show that the energy current in the one-dimensional harmonic chain (1.50) is 


a, ay (1 .467) 


]2. Assume that the Hamiltonian in the tight-binding model has additional terms describing how 
phonons (b;, b’) on a site į interact with electrons n; when they are on the site. How do these terms affect 
the heat current? 


D loob; b; + Mnj(b} +b) (1.468) 
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13. Explicitly verify the equation of continuity, 


ð 
z p(r, + V >: jr, t) =0 (1.469) 
for a gas of free fermion particles which have 
H= 2 Enh Cho (1.470) 
oO 
l —iq'r 
p(r, 1) =< x eFC golko (1.471) 
qo 
. 1 iq- 
it, ) =— 2 e 9K + Laden goCko (1.472) 
qo 


14. Find the effective interaction Vp(r) between two fixed electrons (1.247) using the deformation 
potential interaction (1.249) and a Debye spectrum. 


15. Find the exact solution in one dimension and the partition function for the XY model with a 
magnetic field in the z direction. 


16. The Holstein—Primakoff transformation between spin one-half operators and bosons is 


SP =!~-adla, (1.473) 
S® = (1 —ala)"a, (1.474) 
SO =a) — ala)!” (1.475) 


where the a, are boson operators. 
(a) Use these definitions to evaluate the following commutators: 
[S SO}; SP, SO] (1.476) 


(b) Write out the Heisenberg Hamiltonian in terms of the a operators. 

(c) At low temperature, where the number of excitations ñ; < (al a,) is small, one can simplify the 
Hamiltonian by neglecting all terms of the form atataa which describe the scattering of two 
excitations. Then one can solve the Hamiltonian exactly. What are the eigenvalues? 


Chapter 2 
Green’s Functions at Zero Temperature 


Many-body calculations are often done for model systems at zero temperature. Of course, real 
experimental systems are never at zero temperature, although they are often at low 
temperature. Many quantities are not very sensitive to temperature, particularly at low 
temperature. Zero temperature calculations are useful even for describing real systems. 
Furthermore, the zero temperature property of a system is an important conceptual quantity— 
the ground state of an interacting system. A system is often described as its ground state plus 
its excitations, and the ground state may be deduced from a zero temperature calculation. 
Many zero temperature calculations have been done to deduce, for example, the ground state 
of the homogeneous electron gas or the ground state of superfluid “He. 

Some workers believe that zero temperature calculations are easier to perform. There are 
usually more terms at nonzero temperature, which is a nuisance. We do not completely share 
this viewpoint. The Matsubara methods, which are described in Chapter 3 for nonzero 
temperature, are very easy to use. We usually do zero temperature calculations by first finding 
the nonzero temperature formulas and then taking the limit of zero temperature. However, 
sometimes it is easier to do zero temperature calculations from the beginning. The zero 
temperature formalism is a necessary part of one’s calculational machinery. 

It is presumed that one is trying to solve a Hamiltonian which cannot be solved exactly. 
One does not need Green’s functions if the problem may be solved exactly. Very few exact 
results were obtained by Green’s functions which were not first obtained by conventional 
theoretical techniques. It is assumed that one is trying to deduce the properties of some 
system described by a Hamiltonian H which may not be solved exactly. The usual approach 1s 
to set 


where H is a Hamiltonian which may be solved exactly. The term V represents all the 
remaining parts of H. One tries to choose Hy so that the effects of V are small. The basic 
procedure is to start with a system completely described by Hp. The effects of V are intro- 
duced, and then calculations are done to find how V changes the properties. These steps are 
the basic procedure in many-body theory. 
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2.1. INTERACTION REPRESENTATION 


Most readers of this book should be familiar with the interaction representation. A short 
discussion of this topic will be presented anyway, as a refresher. There are three repre- 
sentations which will be discussed: Schrödinger, Heisenberg, and interaction. 


2.1.1. Schrodinger 


Elementary quantum mechanics is taught in the Schrödinger representation, which is 
based on the formula (å = 1) 


„0 

IZVO = AVO (2.2) 
which has the formal operator solution 

W(t) = e™ 40) (2.3) 


The use of this formula requires some assumptions: 


1. The wave functions y(t) are time dependent, even if this dependence is only a simple 
factor of exp(—iE?). 
2. Operators such as the Hamiltonian H are taken to be independent of time. 


2.1.2. Heisenberg 


It is possible to solve quantum mechanical problems another way which gives exactly the 
same answers yet uses methods that look quite different. The Heisenberg representation has 
the following properties: 


1. The wave functions are independent of time. 
2. The operators are time dependent, and this dependence is given by 


O(t) = 00e (2.4) 


or, equivalently, one is trying to solve the equation of motion which is derived from a time 
derivative: 


i O() =[0(0, H (2.5) 


Note that the time dependences of these two representations appear to be contrary. Yet it was 
asserted they give the same answer. To prove this identity takes too much space, so the result 
is just made plausible. 

In physics one is usually trying to evaluate matrix elements in order to determine 
transition rates. In the Schrödinger representation, the matrix element of the operator O(0) 
between two states is 


VEON) = Ob Oe OM)e,(0)) (2.6) 


In the Heisenberg representation one obtains the result 


(Yi O)OM (0) = Abt (Oe Oe" (0)) (2.7) 
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The two representations produce the same result. In order to understand this assertion, first 
recall that the Hamiltonian H will include some interaction terms. In the Heisenberg repre- 
sentation, these interactions act upon the operators, thereby changing them. When taking 
matrix elements, these changed operators are projected back upon the unchanged states in 
order to see how much the operators are changed by the interactions. On the other hand, the 
Schrödinger picture leaves the operators fixed and instead has the interactions affect the wave 
functions. Then the new wave functions are used to evaluate the matrix elements of the 
unchanged operators. The result is the same either way. 


2.1.3. Interaction 


The interaction representation is another way of doing things. Here both the wave 
functions and the operators are time dependent. The Hamiltonian is separated into two parts, 


where H, is the unperturbed part, while the V are the interactions. At the moment the exact 
form of V is left unspecified, and we merely note that it may be either, some, or all of the 
interactions discussed in Chapter 1. Usually Ho is selected as a Hamiltonian which is exactly 
solvable. 

Operators and wave functions in the interaction representation will be denoted by a caret, 
such as in V. This notation will distinguish their time dependence from that of the other 
representations: 


1. Operators have a time dependence 
O(t) = e Oet (2.9) 
2. Wave functions have a time dependence 
WO) = ete NYO) (2.10) 
It is assumed that [H), V] 4 0. If these operators do commute, the problem is usually too 


trivial to require many-body theory. When these operators do not commute, the exponentials 
cannot be combined, because 


ele = et (2.11) 
only if [4, B] = 0. 


Before going any further, let us check to see that this choice of time dependence does 
produce the same matrix elements as before: 


VEDAVA) = (yt (Oe e (elo! Oee eH, (())) 


= (y'O O(0)e",(0)) (2.12) 
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It does. The time dependence of the operators is governed by the unperturbed Hamiltonian. 
Next show that the time dependence of the wave functions is governed by the interactions 


© WO) = ie Eh — MeO) 
— —je ot Ve- ™ (0) 


TOOTO (2.13) 


This result proves the assertion that the time dependence of W(t) is determined by V(t). An 
operator was introduced into Eq. (2.10) which is defined as U(t): 


U(t) = eft! eit (2.14) 


This function has the value of unity at t = 0: U(0) = 1. Furthermore, it obeys a differential 
equation which can be written in the interaction representation: 


ð . l 
a UO) = ie (H) — Hye 
= —jelHot V (eto! et )e i 


= iV OĄUĄ (2.15) 


In order to solve this equation, one way of proceeding is by integrating both sides of the 
equation with respect to time: 


t 


U(t) — U(0) = —i | dt, V(t,)U(t,) (2.16) 


Rearranging gives (U(0) = 1) 
t 
0 

If this equation is iterated repeatedly, then 

t 


Ut) =1—i | dt V(t,) + (-iy’ | dt, | dh (ty) V(t) +++ 


0 
= 5 (—i)" | dt, | dtp ++ |" dt,V(t,) V(t)... V(t,) (2.18) 
n=-0 0 0 0 


At this point it is convenient to introduce the time-ordering operator T, which should 
not be confused with the temperature. The T operator acts upon a group of time-dependent 
operators, 


TV (4) V(t.) V)] (2.19) 


and is just an instruction to arrange the operators with the earliest times to the right. For 
example, 


TÖÖ = APEA) ifg>h>t (2.20) 
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It helps to introduce the following step function: 


Ox)=1 ifx>0 
=0 ifx<0 


i ifx=0 (2.21) 
For two operators, the explicit definition of T ordering gives 

T[V (t) V(t) = O — )V(4)V(b) + Om — 4) VQ)V (4) (2.22) 
Of course, if V(t) and V(t,) commuted with each other, then the order of the operators is 


unimportant. The 7 ordering needs to be applied only to operators which do not commute at 
different times. Now rearrange the integral by using the above identity: 


al da | dette red 


1 l a a il t b> a a 
=z al dt | dt V (ti) V(t) + z | dt, | dt, V(t) V(t) (2.23) 


The second term on the right-hand side is equal to the first, which is easy to see by just 
redefining the integration variables t; —> t,t, — t,. These steps give 


1 t t A A t t, . ; 
zl, dt, | dh TW V) = | dt, | dt,V(t,)V (ty) (2.24) 


Similarly, one can show that 


] t t a a N 
al dn | an | deste EPE 


Y| 


t ty ty a a a 
0 0 0 
Returning to the expansion of U(t) gives 
a" t t A ` ` 
U(t) =1 + E= | an | da= | dt, TV DOV G) VN (2.26) 
0 
This expansion may be abbreviated by writing it as 
t A 
0 


However, it should always be kept in mind that the exponential form is really just a shorthand 
for the series definition (2.26). The 7ordered series definition is really equivalent to the 
original expansion (2.18). 
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2.2. S MATRIX 


It was shown in the previous section that the wave function in the interaction repre- 
sentation had a time dependence given by 


W(t) = UAY) (2.28) 
Now define the S matrix as the operator S(t, £) which changes the wave function We ) into 


YA: 


WO) = S, LW) (2.29) 
From the original definition 
Wi) = UW) = SC, NUCO) (2.30) 
which produces the result 
S(t, t) = UUT (C) (2.31) 


Now examine some properties of this operator. The first two of the following identities may be 
proved in a trivial way: 


1. S(t, t) = 1 = U(HUT (A = ete- (e e'ho") 
2. S(t, t) = UMUT = SC, 1) 
3. SE, OSE, t) = S(t, t”) 


The third identity can be shown by appealing to the original definition: 
VO = SC, E) = SE, DSE, PE’) = SEC) 


Finally, it is shown that S(t, t) can also be expressed as a time-ordered operator, 
ð / ð try T / 
ap kts t)= z U(t)U'(t) = —iV(OS(t, t) (2.32) 


which has the solution 


t 
S(t, t) = T exp| -i | dt, a) (2.33) 

t 
The function (0) = (0) was introduced in the discussion of the previous section. It is a 
wave function in the Heisenberg representation, so that it is independent of time. If a 
Schrödinger wave function is defined as 


W(t) = e™ 40) (2.34) 
then (0) is also the Schrödinger wave function at t = 0. Define 
WO = VOYO) (2.35) 


Then (0) is also the t=0 wave function in the interaction representation. At zero 
temperature the only wave function of special interest is the ground state wave function. For 
Green’s functions it is necessary to define (0) as the exact ground state wave function. Since 
the total Hamiltonian is H, the exact ground state must have the lowest eigenvalue of this 
Hamiltonian. An immediate problem is that the eigenvalues or eigenstates of the Hamiltonian 
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H are not known initially. In fact, that is exactly the kind of information we are trying to 
obtain by using Green’s functions. 

There is a problem in that the formalism is based on the wave function (0) which is not 
yet known. Now in the interaction representation set H = Hy + V, where Hp is chosen to be 
sufficiently simple that its eigenvalues and eigenstates are known. Let the lowest eigenvalue 
of H)—its ground state—be denoted po. Somehow the unknown wave function (0) must be 
determined in terms of the known wave function Qo. 

The relationship between the two ground states W(0) and , at zero temperature was 
established by Gell-Mann and Low (1951): 


(0) = SO, 00), (2.36) 
This result will not be proven: instead, it will be made plausible. One known result is that 
VO) = S(t, VO) (2.37) 
Operate by S(0, t) and get 
(0) = S00, IO (2.38) 
since S(0, f)S(t, 0) = 1 by the use of the previous theorems. Let t > —oo, and find 
(0) = S(O, —oo)(—co) (2.39) 


The important assertion is that y(—00) is equal to dp. The traditional argument is that one 
starts in the dim past (t — —oo) with a wave function þọ which does not contain the effects 
of the interaction V. The operator S(0, —co) brings this wave function adiabatically up to the 
present ¢ = 0. It is a wave function which does contain the effects of the interaction V, so that 
it is an eigenstate of H. 

There is an additional property of these states which is needed for the discussion of 
Green’s functions. As t > ++oo, then 


y(00) = S(oo, O)W(0) (2.40) 


One possible assumption is that (00) must be related to Qo. If they are equal except for a 
phase factor L: 


bye” = W(0o) = Soo, 0) (0) = Sco, —00)b, (2.41) 
e! = (o|S(00, —00)|o) (2.42) 


Alternatives to this assumption are discussed in Sec. 2.9. 


2.3. GREEN’S FUNCTIONS 


Most of this book is concerned with Green’s functions for only three types of particles: 
electron, phonon, and photon. The electron and phonon cases will be discussed here. The 
photon case is badly muddled by the several available choices of electromagnetic gauge. 
Section 2.10 is devoted to discussing these complications, and the photon Green’s functions 
will be treated there. But keep in mind that the basic proofs for the photon case are essentially 
identical to the phonon case. 
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At zero temperature the electron Green’s function is defined as 
GA, t — t) = ilI TOO OI) (2.43) 


The quantum number A can be anything depending on the problem of interest. Quite often it 
shall be taken to be the quantum numbers of the free-electron gas à = (p, o), where p is the 
wave vector and o is the spin. At zero temperature the state |) must be the ground state. If the 
Hamiltonian of the problem is chosen to be H, then |) is the ground state of H, and therefore 
it is an eigenstate of H. 

Of course, initially the ground state or any other eigenstate of H is not known since that 
is what is to be determined by using Green’s functions. Write H = H, + V, where Hp is the 
unperturbed part, while V is the interaction. Choose Ho so that its eigenstates are known. In 
Sec. 1.3 the C, are defined in terms of a complete set of states ,. Now select this complete 
set of states as the eigenstates of the unperturbed Hamiltonian Ho. In the definition of the 
Green’s function the C} represent states of Hy, while the ground state |) is an eigenstate of H. 
Furthermore, (2.43) is defined in the Heisenberg representation, so that |) is independent of 
time, while C} (t) is given by the usual result 


C,(t) = Ce (2.44) 


One way of understanding the Green’s function is to observe that it describes a certain 
Gedanken experiment. For t > ¢ 


GOA, t > t) = -illO (2.45) 


Here one takes the real ground state, and at a time ¢ one creates an excitation À. At a later 
time ¢ one destroys the same excitation. Now if i were an eigenstate of H, with HC}\) = 
&,C}|) and H|} = € |), then this state would propagate with a simple exponential time 
dependence: 


GO, t > t) = —i exp[—i(t — ¢)(&, — &)] (2.46) 


Because A is not usually an eigenstate of H, the particle in the state À gets scattered, shifted in 

energy, etc., during the time interval ¢t — £t. When one measures at a later time ¢, to see how 

much amplitude is left in the state À, the measurement provides information about the system. 
For the other time arrangement ¢’ > t 


Gia, t > t) = HiU AOA) (2.47) 


where the sign in front is changed whenever the position of two fermion operators is inter- 
changed. Now an electron is destroyed from the ground state at time ¢ and recreated at the 
later time t, which is possible only if there are electrons in the ground state at zero 
temperature. One case where this is true is in the Fermi sea of a metal. The initial destruction 
of an electron at time ¢ must, roughly speaking, remove an electron from the filled Fermi sea. 
This destruction creates a vacancy, often called a hole, and the hole can interact and scatter in 
the interval t — t. Then ci acting at ¢ destroys the hole state à, and this measurement 
provides information about the hole excitation. 

The next step is to take the Green’s function, defined in the Heisenberg representation, 
and convert it to the interaction representation. Call |)) = o the ground state of Ho, so that 


|) = S(O, —00)|)o (2.48) 


Sec. 2.3 e Green’s Functions 73 


Next change the operators to this representation: 
C,(t) = ete oC, (Nee = UT (NC, (NUD) 
= S(0, )C,()S(t, 0) 
Ga, t — t) = —i@(t — £')y(|S(—00, 0)S(0, 1)C,(HS(t, 0)S(O, t') 
x CT (OSCE, 0)S(O, —00)])o 
+ i@(¢ — t)p{|S(—00, 0)S(0, CË CSE, 0)S(0, t) 
x CDSM, 0)SO, —00)])o 


This expression may be regrouped by using the properties of the S matrix developed in the 
previous section. The left-hand bracket is replaced by 


0 (|S(oo, 0) 


i _ _ = 
9 {|S(—o0, 0) = e™ o {|S(0o, —00)S(—o0, 0) ~~ 9(|S(0o, —00)])g 


so the Green’s function becomes 
i 


GA t= — o (|S(co, —00)|)o 


[O(t — £) (SC, NÊ) 


x S(t, )CL(P)S(, —0)l)o 
— OF — Dlls, VISE, NCLDS(, —00)|) 0] 
The first term can be simplified by writing 
Ol — rls, NC, OS(t, NLS, —00)|)o 
= O(t — £)y(ITC,OCL(L)S(C0, —00)|)o (2.49) 


The operator S(co,—oco) contains operators which act in the three time intervals: 
(00, t), (t, “), and (t, —o0). The T operator automatically sorts these segments so they act in 
their proper sequences, which are, respectively, to the left of C, (0), between C, (£) and Chir ), 
and to the right of Chir ). The total Green’s function is expressed as 


AITOOGE )S(0o, —00) |) 


GQ, tt) = o (ITS(C0, —00)I)g 


(2.50) 
which is the desired result. It does not matter where S(co, —oo) is placed in the numerator, 
since the time ordering operator puts the pieces in the right place. 

A Green’s function can also be defined for the special case where the interactions V = 0 
and hence the S matrix is unity. This Green’s function plays a special role in the formalism, 
and it is designated by GO: 


GOA, t= t) = Ig (TOC 0 (2.51) 


G® is often called the unperturbed Green's function, or sometimes the name free propagator 
is used. 
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There are two quite different types of electronic systems in which we want to employ the 
Green’s function analysis. These two have quite different ground states, |}ọ, and also quite 
different real ground states, |). These two systems are the following. 

1. An empty band. Here the properties are studied of an electron in an energy band in 
which it is the only electron. An example is an electron in the conduction band of a semi- 
conductor or an insulator. In this case the ground state is the particle vacuum, which is 
denoted as |0). This state has the property that 


C,|0) = 0 (2.52) 
a,\0) = 0 (2.53) 


where the two destruction operators represent electrons and phonons. Therefore both Ho and 
V give zero when operating upon the vacuum. It follows that the S$ matrix also gives unity 
when operating upon the vacuum: 


S(t, —00)|0) = |0) (2.54) 


Both of the ground states, |), and |), are the vacuum |0). The Green’s function can exist only 
for the time ordering 


GO, t — t) = -i@(t — #)(0|G,(DCL()0) (2.55) 

The unperturbed Green’s function G® is particularly easy to evaluate: 
GO, t — t) = —iO(t — fe 0C C110) (2.56) 
= —iO(t — fe (2.57) 


The Fourier transform of G(A, t) is defined as 


G(A, E) = |" dte G(x, t) (2.58) 


—0O 


To make the integrals converge, add the infinitesimal quantity ið to the exponents: 


OO 
GOK, E) — -i| dteE—&. +8) 
0 


GO (A, E) = 7 l (2.59) 


— &) + 10 
2. A degenerate electron gas. Our second case is where the electrons are in a Fermi sea at 
zero temperature. The standard example is a simple metal. The system has a chemical 
potential u, and all electron states with E < u are occupied. If the unperturbed electrons 
(eigenstates of H,) are characterized by an energy £p the ground state |), has all states €% < u 
filled and states ¢, > p empty. It is convenient and conventional to measure the electron’s 
energy relative to the chemical potential, to define €, = €, — p. For a spherical Fermi surface 
with Fermi wave vector pr, 


ICC lo = Opp — K) (2.60) 


(IC Ch l)o = Ok — pr) (2.61) 
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A more general way to write this result is 


1 
im ep = nr(§) 


o(lCkCklo = O(—E,) = lim 
The unperturbed Green’s function is now 
GOD, t= 1) = -iO — 1) (IGOCLO No + OC — Do ICL 
= -[O(¢ — NEk) — OY — NO(-E, Je (2.62) 
The Fourier transform is 


0 
—00 


GO(k, E) = -if 0) | dte” P-t) _ ato | irene) 
0 


Q 0(— 
O G OS) 0.63) 
E-&+i6 E-&-—i6 
The energy Ëk = Ex — H is measured with respect to the Fermi surface p, = 0. Another way 
to write G® is 


1 
(0) — 
Ôk = 6 sgn(C) (2.65) 


where 6, is a small infinitesimal part which changes sign at the chemical potential ¢, = 0. 
3. Phonons. The Green’s function for phonons is defined as 


D(q, À, t — t’) = ~i(\TAg (DA_g (D1) (2.66) 
Ag, =A ta! gy (2.67) 


The subscripts À refer to the polarization of the phonons. Most applications have one kind of 
phonon in Hamiltonians which do not mix polarizations, and then these subscripts are omitted 
entirely. In the interaction representation 


_ olITAg(t)4_q(t’)S(00, —00)|)o 


Dat 1) = (IS, —00) Io 


(2.68) 


At zero temperature there are no phonons. The ground states |) and |}ọ are again the particle 
vacuum |0}. Note that in an electron-phonon system the notation |), means the combination 
of ground states for electrons, phonons, etc. Although the phonon system has the vacuum as 
its ground state, either of the two-electron ground states can be used. 

The unperturbed phonon Green’s function is defined as 


D(q, t — t) = ip |TAG(A_g(t)I)0 


= —ig(|T(age" + at gel"')(a_ge 1s" + aj ea")|) (2.69) 
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At zero temperature 
ollaqa} i)o = 1 


o(laiagl)o = 0 


D(q, t — t) = —if@(t — fea" + OC — Dela] (2.70) 
The Fourier transform has two terms which correspond to the two time orderings: 
D(g, ©) = | dte DD (q, t) (2.71) 
—& 
D(q, ©) = — to (2.72) 
O-@g tid W+@, ið 
20, 
= —___} __ (2.73) 


o? — og + 16 


Sometimes it is useful to have the phonon Green’s function at nonzero temperature. In this 
case, the thermal average is taken of the phonon occupation numbers, 


0(l4g4q)0 = Ng +1 


l 
T — — 
0(l4g4ql)o = Ng — Pq _ 1 
and the Green’s function of time is 
DO(q, t — 1) = —i[(Ny + eal! + Nye] (2.74) 


In some many-body systems the interactions cause changes in the phonon energies. For these 
cases, the thermal average at nonzero temperature should be over the frequencies which result 
from the interactions. The use of the unperturbed frequencies is valid only if they are not 
altered, which happens when the perturbation is either localized or confined to a small number 
of particles. 


2.4. WICK’S THEOREM 


The Green’s function is evaluated by expanding the S matrix S(oo, —oo) in (2.33) in a 
series such as (2.26): 


œo (Antl poo oO 
Gp t-)=F 2 | ar, | dt, (2.75) 


n=0 —0oo 


ITC (V(t) V (th) V(t, CEC 
* o(|S(oo, —00)|)o (2-70) 


Let us, for the moment, ignore the phase factor 9(|S(co, —oo)|)9. It will be taken care of in 
Sec. 2.6. The immediate aim is to learn how to evaluate time-ordered brackets such as 


oT APODO o (2.77) 
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Suppose that V(t,) is the electron—electron interaction Eq. (1.160): 


a me o 
Vit 1) = — 5 am Chg. Cy —q,s’ Crs Cy se” (Crta ték -qbk =w) 
2 kq ss 


In this case the time-ordered bracket (2.77) contains seven creation operators and seven 
destruction operators. It is a very arduous task to evaluate this bracket: there are many 
possible time orderings and many possible pairings between creation and destruction 
operators. What is meant by pairings between operators? First note that these brackets always 
contain the same number of creation and destruction operators. One is always trying to 
evaluate the product of n creation operators and n destruction operators between the ground 
state |}o 


ITC OCHA)» CE JICLOWo (2.78) 


The effect of a creation operator ci (r) is to put an electron into the state n’. The system must 
be back in the ground state before the final operator of »(|, so that one of the destruction 
operators Č (tm) must destroy the state n’ and m = n’ for some m. For example, 


(ITC, (OC) (t’)|)o (2.79) 
equals zero unless « = B, while 
o ITAN (ICL (2.80) 


equals zero unless « = B, y = 6, or unless « = ò, B = y. There are many possible time 
orderings and pairings in a bracket like (2.77). However, only a limited number of these 
combinations are physically interesting. Our aim is to sort these in a simple way to identify 
the important terms. This sorting is achieved with the help of some theorems which simplify 
the procedures. The first of these is Wicks theorem. 

This theorem is really just an observation that the time ordering can be taken care of in a 
simple way. Wick’s theorem states that, in making all the possible pairings between creation 
and destruction operators, each pairing should be time-ordered. The time ordering of each 
pair gives the proper time ordering to the entire result. 

For example, 


ITC ANCHE IC (ECO) )0 
o (ITAS ITEE 0 
- -_saresochOnwttretenetoons 
= 8,546 o (ITOE) o0 (ITCH) o 
— 8,358, o (ITAICE) 00 (ITCy)ICHE Mo 


Note that there is a time-ordering operator T in each of the two pairing brackets. For n = 3 
creation and destruction operators in the original bracket, there are six possible pairings; for n 
operators of each kind there are n! possible pairings. Also note that within a pairing bracket, 
the labels a, B, etc., must be the same. These labels denote eigenstates, so the creation and 
destruction operators must refer to the same state. 
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A few simple rules should be kept in mind when making these pairings. The first is that a 
sign change occurs each time the positions of two neighboring Fermi operators are inter- 
changed. One keeps count of the number of interchanges needed to achieve the desired 
pairing. An odd number of interchanges is the origin of the minus sign in the second term of 
the example above. 

The second rule concerns the time ordering of combinations of operators representing 
different excitations. For example, consider the following mixture of phonon and electron 
operators: 


(ITC (CE (Ag, (p, (2) Ch (5) 4g, (H)I)0 (2.81) 


Because electron operators commute with phonon operators, it is not important how they are 
ordered with respect to each other. The bracket can be immediately factored into separate 
parts for electrons and phonons: 


ITCo(ACE C), ICE (t5)1)00 ll TAa, (41) 4q, ()1)0 (2.82) 


This separation is always possible with different kinds of operators, 1.e., whenever operators 
commute. Wick’s theorem also applies to brackets of phonon operators; for example, 


(IL Ag (t Aq, (Ag, (13) Ag, (t4)|)o 
= o(ITAg (ti Aq, (t2)1) 00 (l TAg, (t3)4q, (t4)l)0 
+ o(ITAg, (t1)4q, (t3)1) 00 (TAa, (t2) Aq, (ta) 0 
+ o(ITAg, (t1)4q, (t4)!)00(I 7g, (2) 4q, (43) Io (2.83) 


In this case, for boson operators, there is no change of sign when the order of the operators is 
changed. Each of the above brackets vanishes unless the two wave vectors are equal and 
opposite: 


= Ôq +q %q3-+44 o(ITAg, (t))A_g (I) o0(ITAg, ()A_g, (t4)|)o 
+ 84, +4, qyq, 041g, (A-a, (65)1)00(1 74g, 24a, (44) I) 0 


+ 84,+4,5q,+4;0(I74q, (114g, Ga))00(I 7g, (2)A-q, )I)0 (2.84) 


The third rule is a method of treating the “time ordering” of two operators which occur at the 
same time, such as 


ITC, Ck, Io (2.85) 
In these cases the destruction operator always goes to the right: 


= Sk, =, 0(ICh, (11) Ci, (ti)l)o = OK, <1, (Sx, ) (2.86) 


and the term is just the number operator which is independent of time. This convention is 
dependent on the convention used to write down the Hamiltonian. In constructing H the 
destruction operators are put to the nght of the creation operators in all terms in the 
Hamiltonian. 
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When two electron operators have different time arguments in a pairing, it is conven- 
tional to put the creation operator to the nght: 


(ITC (HC, (h)l)o = Sk, 1,0 I Ce, (CL, (4) N00 (2.87) 


This term can be immediately identified as the unperturbed Green’s function iG (k,, t, — t,). 
All of the pairing brackets for electron operators are either Green’s functions or else number 
operators. The previous examples can also be written in terms of Green’s functions: 


(1 ((ITCOCUA)C ICE o 

= Supsa CO, t — 1) GY, b — t’) 
= öpp CV (a, t — GOY, ty — t) 

(D ollT Aa, Aa, (h)a, ()Aa, (ta) lo 

= A AA DA CTE ti — t)DO (gg, t — t4) 
+ Ôq, +q; N D C TE ti — t3)D(qy, ty — ty) 
+ §4,+4,54,4q,> (G1. ti — t4)D (qp, ty — b) 


In summary, Wick’s theorem states that a time-ordered bracket may be evaluated by 
expanding it into all possible pairings. Each of these pairings will be a time-ordered Green’s 
function or a number operator np or ng. This expansion gets the correct time-ordering for the 
entire brackets. Wick’s theorem is valid only when the Hamiltonian Hp is bilinear in creation 
and destruction operators. The theorem fails if Họ contains pair-wise interactions between 
particles. 

A comprehensive example is done for the n = 2 term in the S matrix expansion in 
(2.76). The n = 0 term is always GO: 


GPi- = 09t- | di TEAPA 


+ D dn | dis ATOPAN 


The interaction will be taken as the electron-phonon interaction: 


V = 2 MaAq ct Cys (2.88) 


+q,5 
First note that the n = 1 term must vanish because it contains the factor 


o({TAgl)o (2.89) 


which is zero since the factors (O|a,|0) and (0|a4|0) are zero. Similarly, all the terms where n 
is odd vanish because their time-ordered bracket for phonons contains an odd number of Ag 
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factors. Only the terms even in n contribute to the S-matrix expansion for the electron-phonon 


interaction: 
OO (0,0) 
| dt, | dt, 
—CX —OO 


x E My Mao (ITÀ, (t1)4q, (42)!)0 
4:92 


G(p, t — t) = Gp, t — t) + 


(-i)° 
2! 


x 2 ITC (ÐC aq, Cr, (t) 
Koss” ' 


x ĈE ra, tC Io 
The phonon bracket gives a single-phonon Green’s function: 
o(ITAg (tiA, (4)1)0 = ðq, +q, DO (Q1; f — t>) (2.90) 


The electron bracket, unfortunately, has six possible combinations of pairings. The six terms 
are evaluated using the fact that q, = —q,. Wick’s theorem gives the result 


o(lTp OË 44, stk, (th +4, s(t) Cs Ao 
= MIT poA, 44, .s)I) 004 TC, DGE a, slo 
x (ITC Ct Ao 
+ ITC OË 14, '(t)00IT Cx, stp oOo 
x (ITC elt )CK +4, s(t) lo 
+ IT Cpl), 44,5101 Ca, (CLO) 
x o (ITF tas (hs (bh)o 
+ ollTÊpo (DCE, ta, v) o0 UT Êr, s Eo 
x MITC 44,8 Cr st )Io 
+ o(lT Cp Ðo) o0 (Ë 4.4, 2 tC, s )o 
x o (TEË 4, (tC, (to) Io 
= o (lT Ailo ITC, s(t Ch 4q,.0°(t2)I)0 


Xo (Tks DÊ +4, D)o 
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Each bracket is either G functions with different time arguments, or else a number operator 
npr at equal times. The six terms, taken in the same order, give 


8 pk =k, +4, Sfp GPP, t — GOP — gy, h — )G(p, t — £) 
+ i* 8 pak, =k, —4, § <6 GO'(p, t — 1) P, +a), t — 1) GOP, 4 - r) 
+ 784-05 p=k, 6 ot (Ex, GO (p, t— t)G(p, t — 7) 
+ i q, =00p=k, õy-onr (Ék GO, t — )G(p, h — t’) 
+ Sq, -05q,-0%r(Ex, Nr (Ék, JG (p, t—t) 


— P84. = -q, v=, t — r)GO (ki, t — h)GO(k, + qy, h — 4) (2.91) 


2.5. FEYNMAN DIAGRAMS 


Feynman introduced the idea of representing the kind of terms in (2.91) by drawings. 
These drawings, called diagrams, are extremely useful for providing an insight into the 
physical process which these terms represent. These diagrams can be drawn both for the 
Green’s function depending on time G(p, t) as well as for functions which are Fourier- 
transformed and depend on energy G(p, E). 

The diagrams in time are drawn by representing the electron Green’s function 
G(p, t — r’) by a solid line which goes from ?’ to t, as shown in Fig. 2.1. An arrow is often 
included to represent the direction. The arrow is mostly for convenience, and it does not imply 
or require that t > ¢’. The phonon Green’s function is represented by a dashed line. It does not 
have a directional arrow because 


DO(q, t- r) = D(-q, t — À (2.92) 


and the sign of q is irrelevant. Phonons can be viewed as going in either direction in time. 
Next consider the diagram for the factor 


oIChs()Cps(Ol)o = ne(Ep) (2.93) 


The occupation number np is drawn as a solid line which loops and represents an electron line 
which starts and ends at the same point in time. By using these rules, the diagrams are 
constructed which represent each of the six terms in the S-matrix expansion (2.91). These six 


Oped) = Ro 
£ á 
(0) ' 4 
D (g,t-t) = ee 
7 z' Fs 
<CptiZicpé)> = Ov 
FA 
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FIGURE 2.1 
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terms are shown in Fig. 2.2, and the diagrams (a) to (f) are the six terms in (2.91) in the same 
order. Each term has the phonon line connecting the times f, and t. 

In Fig. 2.2 the terms (c), (d), and (e) are zero. They vanish because they exist only if the 
phonon wave vector q is zero, but there are no phonons with q = 0. A phonon with q = 0 is 
either a translation of the crystal or a permanent strain, and neither of these is meant to be in 
the Hamiltonian. The sum over q in (2.88) should exclude the q = 0 term. 

The two terms (a) and (b) in Fig. 2.2 are not zero. They are the contributions of primary 
interest. By using the results of (2.91), the contributions of the terms (a) and (b) are 


1 CO (0,0) 
zl dt; l dth, > |M, D®(q, ty — h) 
! v T 


—C 
x Gp, t — 4) GO(p — q, ti — GO P,  - r’) 
+ Gp, t-4)G P + q, t —4)G(p, t - A) (2.94) 


The two drawings (a) and (b) look alike. They differ only in the labeling of the variables 
ti, ty, q. But these are variables of integration in (2.94) and may be relabeled which shows the 
two terms are equal. Only one term need be written, and the factor of 1/2! in front is 
eliminated. 

The term (f) in Fig. 2.2 is 


GO (p, t — F, (2.95) 


—00 


i CO ©O 
F; = -5| dt | dt, © |M? D®(q, ti — ))GO(k, t — h) 
° ee) kq 
x GOK +q, t —t) (2.96) 


(a) (b) 
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FIGURE 2.2 
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FIGURE 2.3 


This drawing has the property that part of it is not topologically connected to the Green’s 
function line G(p,t — r). Diagrams in which all parts are not connected are called 
disconnected diagrams. For example, in Fig. 2.3, part (a) is disconnected, while part (b) is 
connected. The disconnected parts, as in (2.95), provide just a multiplicative constant such as 
F, which multiplies the contribution from the connected parts. 


2.6. VACUUM POLARIZATION GRAPHS 


Now consider the factor which has been ignored to this point: 


o(|S(co, —Oo|)y = D D | dt, | dt, ++: | dt, 


n=0 n! —00 —00 


x (ITV (t) V(t) -P E)o 


Again consider the electron-phonon interaction and evaluate the term n = 2. The n = | term 
vanishes as it did for the Green’s function expansion, 


A2 oo oO A A 
So =1 +E anf ae oli EPO (2.97) 


where 


UTED) = E M, My, 0 ITAg, (t)4q,(b)I0 
qı 42 


x D llT CK yq s(ts)Cr, s(t) 
k, kp ss’ 


x Cha as (t)ko 
By using Wick’s theorem, 
o (TAa (tq, (hlo = ibg ta POA h — h) 
IT Ch, +a, s) Cs Ce, —g, 2 C2) Cry,s1(2)I)o 


= ðq, "Fr (Ëk Mr (Ex,) + Ök =k, —a, GK, t —2)GPK +q — 4) (2.98) 


The Feynman diagrams for the two terms in Eq. (2.98) are shown in Figs. 2.4(a) and 2.4(b). 
The (a) term is zero because there are no q = 0 phonons. The (b) term is nonzero and gives a 
contribution 


o(IS(oo, —oo|)p = 1 +F +++: (2.99) 
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(a) (b) 
FIGURE 2.4 


where F} is defined in (2.96). The constant F} appears whenever the closed bubble of 
Fig. 2.4(b) occurs, regardless of whether the term arises in the disconnected diagrams of 
G(p, t — ¢’) or in the expansion of 9 (|S(oo, —oo])o. 

The terms in the series for 9(|S(00, —0o|), are called vacuum polarization terms. Some 
terms for n = 4, where there are two phonon lines, are shown in Fig. 2.5. Each of these 
diagrams or terms represents a constant F, which one can evaluate by doing the required time 
and wave vector integrals. The constant 9 (|S(0o, —0o|),g could be evaluated by computing all 
the F, and then summing them (Fp = 1): 


g(|S(co, —00|)9 = DF, (2.100) 


J= 


This procedure is unnecessary because of a cancellation theorem. 

The next theorem also simplifies the calculation of the Green’s function expansion 
(2.76). This theorem is that the vacuum polarization diagrams exactly cancel the disconnected 
diagrams in the expansion for G(p, t — £). The net result is that in calculating G(p, t — r') one 
needs only to evaluate the connected diagrams. The other contributions, from the discon- 
nected diagrams and from 9(|S(co, —œ]|}ọ, exactly cancel one another. 

This theorem will not be proven, but only explained. Call G.(p, t — ¢’) the summation of 
all connected diagrams, and the basic theorem is that 


lITCHHCE)S(C0, —00l)o = Ge(P, t — £) (ISo, -001)o 
The Green’s function (2.43) is just the summation of all the connected diagrams: 
G(p,t— t) = G,(p,t- t) (2.101) 


The proof of this theorem is just a counting problem. One must convince oneself that each 
connected diagram has, in higher-order terms in the S-matrix expansion, all disconnected 
parts which exactly add up to 9(|S(co, —0oo|)9. For example, the self-energy diagram in Fig. 
2.2(a) has in higher order the vacuum polarization terms shown in Fig. 2.6. The summation of 
all these terms, to all orders, is just the factor 9 (|S(0o, —0o|)9. The important point is that each 
disconnected part is just a constant factor F;. This theorem is very convenient, since it states 
that one can just ignore the disconnected diagrams. They do not need to be calculated. It is 


just as well, since when they are evaluated they often turn out to be infinity. In fact, it is easy 


FIGURE 2.5 Vacuum polarization graphs 
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to show that they are infinity. The disconnected term F, is defined in (2.96). The integrand is 
only a function f(t, — t,) of t, — t. Change integration variables to 


T=1,- 
s= F(t, +h) 


and Eq. (2.96) becomes 


OQO OO 
F, = | as| dtf(t) (2.102) 
—oo —oo 
The important point is that there is no dependence on s, so 
0oO 
| ds = œ (2.103) 
—Co 


One can show that each disconnected part has an “extra” time integral and has the same 
infinity. 

It has been shown that the one-particle Green’s functions consist of just connected 
diagrams: 


, 2 (-i)” OO (0,6) 
G(p,t-t)= i} dt,--- dt, 


n=0 
x o(lT pott (4) ++ V(t,)I)9 (connected) 


The next step is to get rid of the 1/n! factor. It is eliminated because there are just n! terms 
exactly alike in each bracket of the nth term in the expansion. 
Considering only different terms gives the result 


G(p,t—1) =-i So | di | dto (IT OI) 


—0Oo —00 
x V(t,)---V(t,)l)o (different connected) (2.104) 


The obvious question is then: how can you tell when terms are different? Usually one can tell 
by inspection, although sometimes terms must be examined carefully. In Fig. 2.2, terms (a) 
and (b) are the same and provide the 2 = 2! necessary for this n = 2 term. Similarly, (c) and 
(e) are identical. For example, (a) and (b) differ only in the variables 1,, t and q,. But these 
are dummy variables of integration and so may be relabeled, respectively, to t,, t; and —q,. 
Then the (a) term is obviously the same as (b). 
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The next terms in the electron—phonon expansion have n = 4, which are diagrams with 
two phonons. Here each different connected diagram is found 4! = 24 times. 


2.7. DYSON’S EQUATION 


The Green’s function of energy is defined by taking the usual Fourier transform with 
respect to the time variable: 


G(p, E) = l dte -DG t — 1) (2.105) 
—CO 


This time integral has already been evaluated for the unperturbed Green’s function with the 
following results. For a single particle in a band, the result from (2.59) is 


1 
GO (p, E) = —————- 
P= Fee 
For a fermion in a degenerate electron gas, the result (2.64) is 
Go (p, E) = ————_—— 
E—&,+i0, 
The Fourier transform in time is applied to each term in the S-matrix summation: 


; ; (0 @) (0 @) 
dte?) | dt- | dt, 


oe) 
— —00 —00 


GQ, E) = —i 3 c | 


x o(|Tpo ACi CP) --- V(,)I)o (different connected) (2.106) 


To see what sort of terms develop, consider the example of the electron-phonon interaction. 
The first two terms [Figs. 2.2(c) and 2.2(e) are zero] are G® plus the self-energy term in Fig. 
2.2(a): 


OO (0,@) 
dte- | dn | dt, 


—0Oo —&O —0O 


G(p, E) = GQ, E) + C? MI? | 
q 


x GOP, t— 1) GO — 4, t — b) 
x GOP, t — NDO (q, ti — b) (2.107) 


The phonon Green’s function of energy is defined the same way: 


OO 
D(q, ©) = l dte"®D(q, t) 
—0o 
°° dw —it@ 
D(q, t) = zne D(q, ®) (2.108) 


Using the unperturbed phonon Green’s function in (2.107), 


CO 


D(q, t —h) = | 


—OO 


S DDO, o) (2.109) 


Sec. 2.7 e Dyson’s Equation 87 


and the remaining time integrals are easy: 


oO (0 @) 
| dtelt WE G(p, t— w| dt, ellfi 7h)E-0) GO (p -q.t — b) 
—OO —OoO 


OO 
x | dte EGO, h — r) = Gp, EGO — q, E — o) 
—CO 


The first two terms are 


G(p, E) = G(p, E) + Gp, EEO P, E) + + (2.110) 
where the self-energy of the electron due to one-phonon processes is 
Oo 
d 
Op D=) Prim POG, 0G%P-GF-0) eD 
-œX q 


This self-energy will be evaluated in Chapter 7. 

The electron-phonon interaction has four connected diagrams in n = 4; they are self- 
energy diagrams with two phonons and are shown in Fig. 2.7. These four terms give, 
respectively, the contribution to the Green’s function series (2.104): 


i dt p dt | dt; | dt,G(p, t — (x \MyMy DQ, th — t) 

-oo —~oo 00 00 qq’ 

x DOG), b — uG” P + q; t — hG P, b — )G(p + d'h — t) 

+ GOP +q, ti —6)GOP +q +d, t- h)GOP +d, t- 4G, 4- r) 
+ GO(p +q, t- 6)GOp +q +d, 6 —t)G(p+q, tt, —F)G(, h — 1) 


+) \M,|*D (q, ty — t)D(g, t — Gp + q, ti — 4) G(p, 4 — t’) 
q 


x + GOK, t — t)G®(k + q, h — a) 
k,o 


Figure 2.7 shows the labeling of electron and phonon Green’s functions and the time label of 
each vertex. The Fourier transform of these terms is taken to give their contribution to the 
Green’s function of energy, 


GO(p, EYE (p, EY + G(p, EEO (p, E) + EQ, E) + E°(p, E) 
so that combining this result with the earlier one in (2.110) gives 


G(p, E) = G(—p, E1 + GOpEY + ECP + ECD 4 ECA 4 [GOXDOP 2 (2.112) 


é 
x g Í ED 
ITN BE 2 Who 9? 
>>> 1 aa DE V A ps p> oE E E A EN 
Z bsk: kh á ds EA P é py ta F 


(b) (c) 


FIGURE 2.7 Two-phonon Feynman diagrams in the self-energy of the electron. 
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The first term, which comes from Fig. 2.7(a), contains three unperturbed Green’s functions 
G© and two one-phonon self-energy terms ="). Each of these factors can be associated with 
a piece of Fig. 2.7(a), as illustrated in Fig. 2.8. Similarly, the last three terms in Fig. 2.7 
contain the three diagrams which represent the two-phonon self-energy terms: 


EC (py, E) + ECP, E) 
© dof” da’ , 
= | =| Fy MMg D”, DA, 0) 
—00 —=00 qq’ 
x Optq£+o0Gp+qtq,£+o+) 


x [GOP +q, £+o’)+ G(p+q,£+0)] (2.113) 


Co (pH, E)= l 


— 0o 


do 
zy > Ma D” (q, 0) G(p + 4, E +0) 
q 


i do’ 0 / 0 / 
x — Y GO (k, 0')GO(K + q, © + 0’) 
2n ko 


—&OoO 
Dyson’ equation is obtained by formally summing the series in (2.112), 


GOM, E 


— GO (p, E)Z(p, E) en) 


where the total self-energy X(p, E) is the summation of all different self-energy contributions: 


X(p, E) = $ Z9, E) (2.115) 
j 


So far the example of the electron-phonon interaction has yielded the following four terms 
for the self-energy: 


X(p, E) = £) (p, E) + z2) (p, E) + g2) (p, E) + zC) (p, E)+- 


Contributions such as Fig. 2.7(a) which contain (x)? do not mean that È contains (ZVY. 
The Green’s function expansion contains terms in (=‘)* because the expansion for G is 


GO) 


— — G9) (0)2 (0)3 5:2 (0)453 


Successive terms contain higher powers of each © plus cross terms. Dyson’s equation is 
really a theorem which states that one may sum the series of self-energy terms which develops 
in higher order, and the form (2.114) is obtained with each distinct contribution to Ł 
occurring just once. 


g'a 5" go 5" gio 
SS OO Tl OO 
aT na 


FIGURE 2.8 These four terms give, respectively, the contribution to the Green’s function series (2.104): 
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The derivation of Dyson’s equation has been rather complicated and consisted of a large 
number of steps and theorems. Each step has to be understood before the final result is 
understood. Yet the final result—Dyson’s equation—has achieved a great simplification. It 
states that the exact Green’s function is obtained from (2.114) by just calculating the self- 
energy 2(p, £). The self-energy is a summation of an infinite number of distinct diagrams. 
This method is useful only if X(p, E) can be approximated by the lowest few terms in the 
series. Alternately, sometimes one can sum subsets of diagrams in the series. However, except 
in a few rare cases, it is impossible to get X(p, E£) exactly, and one must be content with an 
approximate result. If the approximate result is not a very good approximation, one should not 
try to solve the problem in this fashion. Some of the alternate methods will be developed in 
subsequent chapters. But basically one should realize that Dyson’s equation is usually useful 
only in weak coupling theory where the perturbation is sufficiently weak that an adequate 
approximation is obtained with a few terms in X(p, E). Later chapters discuss some strong 
coupling theories which use Dyson’s equation. 

A great simplification has been achieved. It is only necessary to evaluate a few self- 
energy diagrams in X(p, E). If they are sufficient, the calculation is finished. If they are not, 
then discard the results and try something else. The formidable-looking series (2.76), which 
served as the starting point, has been reduced to the evaluation of a few terms. 

Dyson’s equation is often written in a slightly different but equivalent form. It is obtained 
by using the algebraic form for G®). For one electron in a band, 


G(p, E) = ———— 
(P. £) E — £, + ið 
G(p, E) = l (2.116) 
' E — £, + iò — È} (p, E) l 
When the electron is in a Fermi sea at zero temperature, 
l 
GO (p, E) = ———_—— . 
(p, E) E-e +i, (2.117) 
1 
G(p, E) = =n (2.118) 
E — £p + id, — X(p, E) 


One aspect of this result deserves special mention. The infinitesimal part 5, switches sign 
depending on whether €p > or €p <u, where p is the chemical potential. Taking the 
imaginary part of (2.117) gives E = ¢,. Therefore ImG(p, E) changes sign depending on 
whether E > p or FE < uų. 

The self-energy has real (Xp) and imaginary (2,) parts which are written as 


X(p, E) = Xp(p, E) + i, (p, E) (2.119) 
Later it is shown that 2, also switches sign at E = p in the same manner: 
u(p,£)<0, E>u 
X(p,£)>0, E<u (2.120) 


The switching of signs of 5, was caused by the distinction between electron excitations with 
Sp > 0 and hole excitations with Ep < 0. This distinction is maintained even when X(p, E) is 
included, i.e., in the presence of interactions. The one-electron Green’s function (2.116) has 
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the feature that 2, < 0, so the imaginary part of the denominator always has the same sign. 
The electron self-energy is sometimes called a mass operator. 
The phonon Green’s function has the same type of Dyson equation: 


D(q, © 
D(q, ®) =- (4. o 


— D(q, o)x(q, ©) ee) 


where the self-energy function for phonons is n(q, œ). This equation may also be written in an 
alternate form by utilizing the following result for D©: 


D(q, ©) = a (2.122) 
o? — og + ið 


20 
Da o= “a 2.123 
(q, ©) Q2 — o2 + ið — 207(q, ©) 


The phonon self-energy term n(q, @) is sometimes called a polarization operator. This name 
is quite descriptive, since the self-energy effects arise from the phonons causing polarization 
in the medium. 

The real and imaginary parts of the self-energies X and n each have interpretations. The 
imaginary part, X; or Ty, is interpreted as causing the damping of the particle motion. They 
are related to the mean free path of the excitation or its energy and momentum uncertainty. 
The real parts are actual energy shifts of the excitation, which may also change its dynamical 
motion. The excitation may alter its effective mass or group velocity because of the self- 
energy contributions. 


2.8. RULES FOR CONSTRUCTING DIAGRAMS 


It is certainly worthwhile for the student to evaluate some self-energy diagrams by the 
method outlined above. One should expand the S matrix, decide which terms are connected 
and which are zero, and finally obtain the self-energies by a Fourier transform. But this 
laborious procedure can be avoided because the self-energy diagrams can be written down 
directly by following a few simple rules. That is, self-energy expressions such as (2.111) and 
(2.113) are easily written down in the form shown in these equations. The evaluation of the 
wave vector integrals in these expressions remains a formidable task. These rules are as 
follows: 


1. Draw the Feynman diagram for the self-energy term, with all phonon, Coulomb, and 
electron lines. 
2. For each electron line, introduce the following Green’s function: 


Gp, zy) = 2B _ (2.124) 
“BM E — £p + id, 

The òg conserves the spin index and indicates that the electron line must have the 

same spin at both ends of the propagator line. This feature is important in spin 


problems. The factor 6, is ô for one electron in a band and 6 sgn (&p) for degenerate 
Fermi systems, 1.e., those with a Fermi sea at zero temperature. 
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3. For each phonon line, introduce the following phonon propagator: 


D(q, ©) = a (2.125) 

co? — we + ið l 
Also add a factor of \M, I? for each phonon Green’s function, where M, is the matrix 
element for the electron-phonon interaction. 

4. Add a Coulomb potential v, = 4ne* /q? for each Coulomb interaction. Note that the 
Coulomb line is always drawn as a wiggly vertical line. The Coulomb interaction is 
regarded as happening instantaneously in time, and time flows horizontally, from left 
to right, in our diagrams. One could, of course, have time flow upwards and draw the 
Coulomb interactions as horizontal wiggly lines. 

5. Conserve energy and momentum at each vertex. Each electron line, phonon line, and 
Coulomb line have their variables labeled to conform with this rule. 

6. Sum over internal degrees of freedom: momentum, energy, and spin. If one is 
calculating a self-energy term X(p, FE) of the electron, then all momentum and 
energies except p and E are internal and must be summed over. 

7. Finally, multiply the result by the factor 


j™ 


om (DERS +1)" (2.126) 


where F is the number of closed fermion loops. The index m is chosen as follows: 


(a) For electron self-energies, m is the number of internal phonon and Coulomb 
lines. For example, m = 1 for (2.111), and m = 2 for (2.113). 

(b) For phonon self-energies, m is one-half the number of vertices. The spin of the 
particle is S, and the factor (2S + 1) is from the summation over spin quantum 
number m,. Electrons have 2S + 1 = 2. The factor (2)*” assumes taking the 
limit v — oo, so that the wave vector summations are integrals. For box 
normalization in a finite volume v, the factor is 


j™ 


(2nv)” 


(D (28 + DE (2.127) 
and then wave vector summations are discrete summations. 


The photon Green’s functions are discussed in the last section. But it seems tidy to 
present the rules for constructing diagrams with photons at this point, so that all the rules are 
together. One draws photon lines between lines which represent charged particles. The photon 
lines are usually dotted and usually are represented just like phonon lines. Charged particles 
interact with the photons through two terms in the interaction Hamiltonian. The first is the 
j- A term. For free particles, this has the form 


= Sie) AW) = Z jaAa) 
i qu 


e 
= ZAD DK +5 @)ChiqoCko (2.128) 
C qu ko 


This interaction brings us to the next rule for constructing diagrams. 
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8. For each photon line which interacts with particles through the j - A interaction, insert 
a factor 


2 
e 
uy 


where D,,,(q, œ) is the photon Green’s function and k and k’ are the wave vectors of 
particles scattered at the two vertices. The other possible interaction of a charged 
particle with photons occurs through the term 


e? 


2mc? 


e? 
DAT = a Z PDA, Ala- K) 


In Sec. 4.5 it is shown that this interaction contributes a self-energy term of e?ng/m to 
the self-energy of the photon, where nọ is the density of charged particles. 
Other texts often define the phonon Green’s function differently than the form which is 
selected here. However, all of these cases have the final product of the vertex and Green’s 
function as 


_2041Mgh (2.130) 
o? — o} + ið l 
For example, one choice has a Green’s function of the form 
(0) Og 
D (q, ©) = —— 2.131 
(4, œ) o — o2 + iô (2.131) 


Then, in the rules for constructing diagrams, one multiplies by the factor 2|M, | /©q for each 
vertex pair. Of course, these steps give the same result. 

Now examine some examples. Electron-phonon examples are given previously in 
(2.111) and (2.113). Some phonon self-energies are shown in Fig. 2.9(a). The first example is 
from the electron-phonon interaction, and the self-energy contribution is a closed fermion 
loop, so that F = 1 in rule 7. This contribution is 


mq, ©) = |M, P®(q, ©) 
—2i 
2n)* 


P(g, œ) = / | az | a pG(p, E)G(p +q, E +0) (2.132) 


FIGURE 2.9 
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The other phonon self-energy contribution arises from the lattice anharmonicity, which leads 
to interaction terms in the ion Hamiltonian proportional to the third power of the phonon 
displacement: 


V=} VQ? = X Ma qAqâqA-qy 
i qq’ 


Here the self-energy term is 
i 


(2m) 


(gq, œ) = fao |da PDO, o')D(q+q',0+0’) (2.133) 
Next consider terms in the electron self-energy arising from electron—electron interactions. 
The self-energy term in Fig. 2.9(b) is called the unscreened exchange energy and is a very 
important contribution to the electron’s energy. Its self-energy is 


È (p, E) = z, 7 | do |200% +q,E +0) (2.134) 
T 
This result can be simplified immediately using an important identity: 
[do 
IE- GO(p +q, E + ©) = —ng(Epiq) (2.135) 
The identity is proved by replacing the Green’s function by its Fourier definition, 
dw (~° , 
S] dt” +9 GO (p + q, À (2.136) 
2T Jo 


and then inverting the order of integrations. The frequency integral gives a delta function in 
time, 


dw it(E+o) 
— e = &(t 2.137 
[Ee () (2.137) 
and 
d 
i|2o%G +q, E+) =iG(p+q,t=0) 


The right-hand side is ambiguous. Since the Green’s function is a time-ordered product, 
different results are obtained depending upon whether t = 0 is approached from plus time or 
negative time, 


GOP + q, t > 0*) = —i(CyypqCprq) = —ill — nr (Epa) 


GOP +q, t > 07) = —i(Ch4qCp+q) = inr(Sp+q) 


In the identity (2.135), the t = 07 result is chosen. It was stated in Sec. 2.5 that equal time 
operators were to be taken in this order. These steps prove the identity (2.135). The self- 
energy (2.134) may be written in the simple form 


1 d° 
E(B) =~ E nrp) = — | Foye pra) (2.138) 


dq 
(2n)° 
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where the subscript x denotes exchange. The self-energy is no longer a function of the energy 

E of the particle, since all E dependence has vanished from the right-hand side. This exchange 

energy is easily evaluated at zero temperature (see Problem 6 at the end of the chapter). 
The next electron self-energy from electron—electron interactions is shown in Fig. 2.9(c): 


—2i 
(2n)* 


where the subscript “H” denotes “Hartree”. The self-energy depends on neither p nor E. The 
identity (2.135) may be used again to produce the result 


Lay = 2v4=0 > nr(Sp) = Ug—0Ne (2.140) 
p’ 


È (p, E) = | dE’ | ep. E’) (2.139) 


where N, is the number of electrons. Of course, when v, = 4ne”/q’, then the limit q > 0 
gives v=o equal to infinity. This term is the unscreened Coulomb energy from one electron 
interacting with all the other electrons in our system. This potential energy is truly a large 
number, which becomes infinity in the limit of an infinite system. But there must be an equal 
amount of positive charge in the system, and the electron interaction with the positive charge 
yields another large number which cancels the present divergence. The Hartree energy is 
defined as the net interaction energy of an electron from both of the negative and positive 
charge sources. It is zero in the jellium model of a metal, but is nonzero for actual systems 
composed of ions and conduction electrons. 
The third electron self-energy diagram is shown in Fig. 2.9(d). Its evaluation yields 


i 
(27) 


The factor P“)(q, œ) is given in (2.132). It corresponds to the closed fermion loop. This 
polarization diagram occurs frequently, is very important, and will be evaluated in Chapter 5. 

Finally, this section is ended with a short lecture on the way to draw diagrams. The types 
of Feynman diagrams are very intimately connected with the types of terms in the Hamil- 
tonian. Figures 2.10(a) and 2.10(b) show the two types of electron-phonon vertices and the 
terms they correspond with in the Hamiltonian. In each case, an electron is destroyed in state 
k (incoming arrow) and created in k + q (outgoing arrow). All of the electron-phonon 
diagrams involve only diagrams which can be constructed from these basic building blocks. 
Diagrams are not drawn in which one phonon makes two electrons, as shown in Fig. 2.9(c), 
since such terms do not occur in the Hamiltonian. Later such terms are encountered for other 
problems. Also, terms are not included in which two phonons are emitted while scattering one 
electron. These are anharmonic terms in the electron-phonon interaction. However, it has 
recently been suggested that they are important in some materials, so that in fact people do 
draw diagrams using them. The important point is that one must start with a Hamiltonian, 
decide which kind of vertex processes are permitted by each interaction term, and draw 
Feynman diagrams using only these basic building blocks. 


X(p, E) = | do | d’queP(q, o)G(p +q, E +0) (2.141) 


2.9. TIME-LOOP S MATRIX 


The S matrix is defined in Sec. 2.2. The time in (2.41) is taken over the interval 
(—oo, 00). The state at t = —oo is well defined as the ground state of the noninteracting 
system. The interactions are turned on slowly. At ¢ ~ 0 the fully interacting ground state is 
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FIGURE 2.10 


(0) = S(0, —0o)by. In condensed matter physics the state at t —> oo must be defined 
carefully. If the interactions remain on, then this state is not well described by the non- 
interacting ground state Ọọ. Alternately, one could require that the interactions turn off at 
large times, which returns the system to the ground state pọ. 

Schwinger (1961) suggested another method of handling the asymptotic limit t — oo. 
He proposed that the time integral in the S matrix has two pieces: one goes from (—o, T) 
while the second goes from (t, —oo). Eventually t — oo. The integration path is a time loop, 
which starts and ends at t = —oo. The advantage of this method is that one starts and ends the 
S-matrix expansion with a known state W(—oo) = pọ. Usually it is the only ground state one 
knows exactly. 

For equilibrium phenomena the time loop method of evaluating the S matrix gives 
results that are identical to the other methods such as those described earlier in this chapter. A 
small advantage of the time-loop is that the formalism has a sounder philosophical basis since 
the state W(oo) is avoided. However, the main advantage of the time-loop method is in 
describing nonequilibrium phenomena using Green’s functions. Nonequilibrium theory is 
entirely based upon this formalism, or equivalent methods. The equation of motion for the 
Green’s function can be cast into the form of a quantum Boltzmann equation for transport 
theory. This application will be developed in Chapter 8. 

A disadvantage of the time-loop method is that it employs six different Green’s func- 
tions. They are discussed next, as a preliminary to the expansion of the S matrix. Two of these 
Green’s functions, called retarded and advanced, are also needed for the later discussion of 
equilibrium theory at nonzero temperature. 
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The need for six Green’s functions is explained by considering the time-loop expression 
for the S matrix: 


S(—oo, —œ) = T, exp| =i | dP) (2.142) 
l 


oop 


The integration path is the time loop shown in Fig. 2.11. The variable s} goes (—oo, Tt) and 
then (t; — 00). The operator T, orders along the entire loop, with earliest values of s, 
occurring first. In expanding the S matrix, Green’s functions are encountered of the form 


GOA, s1 — 32) = ig (|T,Cy(51)CL (82) )o (2.143) 


If both sı and s, are on the top, or outward leg, of the loop then s-ordering is identical to time- 
ordering. However, if both s} and s, are on the lower, or backward leg, of the loop then s- 
ordering is the opposite of time-ordering. This case is called anti-time-ordering. Another case 
is when sı and s, are on different legs. Then they are automatically ordered, and indepen- 
dently of the values of the s arguments. 

Besides the time-ordered Green’s function, it is convenient to also define one that is anti- 
time-ordered, and others that have no time-ordering. These different cases give rise to the four 
Green’s functions. The other two are called retarded and advanced. They are linear combi- 
nations of these four. 


2.9.1. Six Green’s Functions 


For the time-loop expansion, it is necessary to define six different Green’s functions. It is 
possible to employ fewer than six since they are not independent, but using six simplifies the 
notation. They are all correlation functions which relate the field operator W(x, ) of the particle 
at one point x, — (1r,, ¢,) in spacetime to the conjugate field operator Wi" (x5) at another point 
X, = (Fy, t)). The six functions are the advanced G,,,, retarded Ge time-ordered G, anti- 
time-ordered G; and G<, G~ which have no name: 


G (x1, x2) = iV, )W"(X2)) 
G(x, X2) = i yE) 
G(x, x2) = Ot — 6L) (x1, x2) + Olh — t )G< (x1, x2) 
G(x, x2) = Oh — t1) (x1, X22) + Ot — b)G* (x1, X2) 
Gret(X1,X2) = G, — G* = C7 — G; 
Gady(X1.%2) = G, — G7 = G* — G; (2.144) 


The brackets |) and (| have the same meaning of Sec. 2.3 as the ground state of the interacting 
system. The time-ordered Green’s function is the same one in (2.43). Here it has been written 
in the field operator representation. For homogeneous systems in equilibrium, the Green’s 


=r 


FIGURE 2.11 The time-loop integration path in the S matrix. Eventually t > oo. 
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functions depend only upon the difference of their arguments (x,, x2) = (x, — x»). Then the 
most simple, and useful, quantities are the Fourier transforms of these quantities: 


G(k, E) = | res | de Ge t) (2.145) 


where the symbol G represents any of the six functions. Explicit expressions for these 
quantities are given below. 

Often the Hamiltonian H can be solved exactly in terms of eigenfunctions o,(r,) and 
eigenvalues €,. Two examples are the electron in a magnetic field or a free particle. Then it is 
useful to have the expressions for the Green’s functions in terms of these eigenfunctions. They 
are derived by expanding the field operators in terms of the eigenfunctions and creation il 
and destruction C, operators: 


px) = 2 Cy, (re ™ 


pœ) = 3 CÌ oF (re™ (2.146) 


The Green’s functions in (2.144) are evaluated with the occupation factor n, = (cI C,) and 
t = ti — h. At zero temperature n, = ©(—E€,) is a step function that is zero or one depending 
upon whether €, = £, — p is positive or negative: 


G (1%) = “FL — marine 
GA (12) = EL mor (roi Ee 
Gilt. %2) = =i [OO — mloxri)ozrnde 
Gil, x2) = =i DIOC) — mI dar oie 
Grai, x2) = —1O() 2 pepi ™ 


Gaay(%1,X2) = iO(—t) 2 ppi) (2.147) 


The above formulas are also valid in equilibrium at nonzero temperatures if n, = 
1/[exp(B6,) + 1] is the thermodynamic average (B = 1/kgT) of the occupation number. 
The starting point for any calculation, at least conceptually, is the behavior of the Green’s 
functions for systems without interactions. Then the wave functions are those for plane wave 
or noninteracting Bloch states, if such can be defined. The quantum number À becomes the 
wave vector k, and a spin index o, which is usually not written. The eigenvalue combination 
is p, (r1)p} (r2) = explik » (r; — r2)]/v. The superscript “(0)” or the subscript “0” on the 
Green’s functions means those for a noninteracting system in equilibrium. Fourier-trans- 
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forming the r variable to k as in (2.145) gives the free-particle Green’s functions G(k, 2). 
For fermions of band energy €, and occupation number ng = n;(€, — H) they are 


Go (k, t) = —i[1 — nyje 
Go (k, t) = inpe **" 
Gk, ) = HOA — n, Jew 
Gk, t) = -O(t — nle 
Geet (Kk, t) = —iO(t)e*«! 


Gag (Kk, t) = i@(—t)e~**! (2.148) 


The ¢ variable can be Fourier transformed, which gives the noninteracting Green’s function of 
energy E; the quantity ò is infinitesimal 


G? (k, E) = —2ni[1 — mJS(E — e) 
Go (k, E) = nin, OE — Ex) 


Geet (k, E) = eT 
Gagy(k, E) = Fac 
GO(k, E) = G,. + GF = EEA 
GO (k, E) = GF — Gray? = = (2.149) 


The time-ordered function Go is exactly the same one in (2.64). Note the two kinds of 
infinitesimal deltas: 6 is always positive, while 6, is positive for k > kp and negative for 
k < kp as defined earlier in (2.64). The retarded functions always have a positive ô, even for 
electrons in a partially filled band. The noninteracting advanced Green’s function resembles 
that of an empty band in (2.59). These two Green’s functions could differ as soon as inter- 
actions are introduced, since they have different self-energy functions in degenerate Fermi 
systems. Also note that expressions such as Ge = G, — G< are obeyed for interacting and 
noninteracting functions They are obeyed for both cases of arguments (k, £) and (k, E). 
The above Green’s functions are suitable for particles such as electrons, or holes in 
semiconductors. Another type of Green’s function is needed for boson fields such as phonons 
or photons. For phonons let Q(x) be the displacement from equilibrium of the ions in the solid 
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at position x = (r, t) in spacetime. Displacements are vectors, but treat them as scalars for the 
moment. The phonon Green’s functions are defined as follows: 


D? (x1, x2) = —i(Q%)O@2)) 
D= (x1, x2) = —i(Q@2)Q@1)) 
DAX1, x2) = O(t — L)D? (x1, x2) + O(t — t) DI (x1, x2) 
Di, x2) = O(t — t )D7 (x1, x2) + OF, — L )D (x1, X2) 
Det = D, — D~ = O(t — b )[D7 — DX] 
Da = D, — D™ = -O(t — t) LD7 — DX] (2.150) 


These expressions are rather similar to those in (2.144) for particles. The main difference is 
that D< and D~ have the same sign, since no sign change is made when interchanging the 
positions of boson operators. Also the displacement operator is Hermitian [Q = Q], which 
introduces some redundancy such as D<(x,, x2) = D? (x5, x;). 

The displacement operators Q are usually represented in terms of phonon raising (at) 
and lowering (a) operators. The usual case is to use A, = ala +a, instead of Q(x) in the 
definition of the phonon Green’s function. In this representation, the phonon Green’s func- 
tions in equilibrium are expressed in terms of the phonon occupation number N, = (aha,), 
which equals 1/[exp(Bo,) — 1] in thermal equilibrium at nonzero temperature, and equals 
zero at T = 0. The noninteracting results are: 


D? (q, t) = —i[(Nq + De’ + Nye] 
D<(q, t) = —i[(Ny + 1)e*’ + Nye] 
Dala, t) = —20 (4) sin(a) 
D,av(q, t) = —20(—t) sin(@q) 
D,q, t) = —i{(Nq + O(—D)e’ + [N, + OCDE} 
Diq, t) = —i{(Nq + OA) + [Ng + O(-Nle*’} (2.151) 


These Green’s functions are used in the expansion of the S matrix. 


2.9.2. Dyson’s Equation 


Each of the six Green’s functions can be evaluated for an interacting system. They can be 
expressed in the interaction representation in terms of the time-loop S matrix. For example, 
one of them is 


G= (x1, x2) = SW") W@,)) (2.152) 


where the S matrix is given in (2.142). This argument has no time-ordering operator since the 
order of the two state operators is fixed. However, the S matrix is time-ordered. The time ¢ is 
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on the upper loop, while t, is on the return loop. The Green’s function in the above equation 
has an S-matrix expansion of the form 
< . (—i)" A ^ À S 
G) =E- | dy [asrin -PD ENE) (2.153) 
where all s integrals are over the time loop. A method is needed to evaluate expressions of this 
type. 
The potential V is composed of electron, phonon, or photon operators. The operators are 
paired using Wick’s theorem. Each pair will have a time argument such as G(s,, s;). If both s; 


i» 
and s, are in the top loop, the expression is just the time-ordered Green’s function. If they are 
both in the return loop, the expression is the anti-time-ordered Green’s function. If one s 
variable is in the top loop and the other is in the bottom loop, then the 7, operator makes this 
expression be either G< or else G~. These relationships are shown in Fig. 2.12. The mth term 
in the Green’s function expansion is a product of (n + 1) factors, where each factor is one of 
the four Green’s functions in Fig. 2.12. 

It helps to have a simple example. Below is given a potential term V of the type found 
for electrons scattering from impurities. The first term in the S-matrix expansion for GS with 
this interaction is 


V = > M,gCiCg 
af 
G*(A, ti — h) = G À, t — h) 


+r Map | dso(I7Ch(e)Cp(Iao(IZCHIC (tI (2.154) 


The s integral runs over the time loop. For G< remember that ¢, is on the top loop while h is 
on the return loop. There are two possibilities. If s is on the top loop, the first Green’s function 
in the s integral is G< and the second is time-ordered. If s is on the return loop, the first is 
anti-time-ordered while the second is G<. The two terms are (s = t, t > 00) 


G<(h, ti — b) = GAA, t — h) + Ma | dt [GO (a, ti — GZA, t — ty) 


— GZA, ti — GOA, r — 1) (2.155) 


A sign change occurred in the last term when the direction of the time integration was 
changed from (co, —0o) to (—oo, co). The above expression contains only the first two terms 
in the S-matrix expansion, which has an infinite number of terms. 

In the expansion of the S matrix, each time integral produces one set of terms for the 
outward s leg, and another for the return leg. The nth term in the S-matrix expansion produces 
2” arrangements. All of these terms can be managed by using a matrix formulation. 


S; 

DEDI EDA 

S2 S2 
= > 
aa LDS 
S; 
FIGURE 2.12 The four Green’s functions G(s;, s2) depend upon whether the time variables (s,,s5,) are on the 
outgoing or return parts of the time loop. 
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Keldysh (1965) was the first to develop the time-loop theory for solid state applications. 
Here the theory is presented using a matrix notation suggested by Craig (1968). He expressed 
four of these Green’s functions as the elements of a 2 x 2 matrix. The self-energy terms are 
also a matrix: 


_ [E -5< 
Š = (2.156) 
=> 5; 


The actual form for the self-energy functions is discussed in the following chapters. 
For systems either in equilibrium or nonequilibrium, Dyson’s equation is most easily 
expressed by using the matrix notation: 


Člena) = Čo 22) + | dxs | daol — x3) 


—OO —OO 
x L(x; ; x4)G (x4, Xo) (2. 1 57) 


The matrix formulation comes directly from the time loop. Each s integral in the S matrix has 
an outward and return leg. Each of these legs gives a different Green’s function. So each time 
integral generates two Green’s functions. Hence the usefulness of the 2 x 2 matrix formalism. 

Some simple expressions can be obtained for the Green’s functions. First write the above 
equation in a notation where the product of two functions implies an integration over the four- 
variable dx, which condenses the same equation to 


Then the equations are iterated. The following exact expressions are derived for the equations 
obeyed by the various Green’s functions, using the same product notation: 


Geet = Geet UL + Erect Gret] 
Gady = Gaav OL + aay Gav] 
G7 = [1 + Grp XretlGo [1 + Lady Gaay] + Gree Gadv 
GS = [1 + Gree2Xret]Go [1 + Lady Gadvl + Gret>  Gadv 
G, = [1 + Grp Zrer]G [1 + LagvGadv] + Gret=7 Gacy 
E; = [1 + Gral 2 [1 + Lady Gaav] + Gree Gady (2.159) 


These equations represent multiple integrals in d?rdt. 

Equations (2.159) simplify considerably for homogeneous systems in steady state where 
the arguments of the Green’s functions and self-energies depend only upon (x, — x2). If the 
equations are Fourier transformed, then all quantities depend only upon (k, œ). The above 
equations are, after Fourier transforming, just algebraic quantities, which are easily solved. 
The time-ordered Green’s function has the same equation as given in (2.118). Some of the 
other Green’s functions are given below. The expressions are presented for nonzero 
temperature. The zero-temperature cases are found by setting the fermion occupation factor 
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np(@) = O(—@). Also define a quantity A(k, œ) = —2ImG,,,(k, @) which is called the 
spectral function: 


1 
G k, = ; = — — 
ret ( o) O — & Det Oo w Ek Rež et 
1 
Ga (K, ©) = ——————__ , 2 v= Bet 
ad O — Ek — Lady ad t 
2r 
A(k, o) = oa T= —Im2 vet > 0 
l 
L< = 2i I (k = ————— 
inp (0) ( ’ o), np(0) ebo + 1 
G~ = in,(o)A(K, œ) 
E> = -2i(1 —np)T 
G? = —i(l — np)A (2.160) 


The notation T = —Im2,,, will be used throughout, and rather often. The spectral function is 
an important quantity, which is discussed further in the next chapter. These formulas are 
incomplete without a prescription for calculating the different self-energy functions X~, È et» 
etc. They will be presented in the next chapter. 


2.10. PHOTON GREEN’S FUNCTIONS 
The interaction of charges with themselves and with the photon field was discussed in 
Sec. 1.5. For spinless particles, this interaction has the Hamiltonian in the nonrelativistic limit: 
H=y- l At] +220 at a (2.161) 
= 29m P; oi 2, | kK, fki 


The vector potential is given by the expansion 


1 1 
A, = — y` e™"A (k, À, 2.162 
aA R° uC t) ( ) 

an\ 1⁄7 o 
A,(k, À, t) = (=) E (KA) (a,e7 Ok’ + a! ek’) (2.163) 


The creation and destruction operators al. a,, obey boson statistics. Each state with wave 
vector k and polarization À has its own harmonic oscillator statistics. The vector potential 
represents the photon field. Two charges may interact via their common photon field or more 
directly through the instantaneous Coulomb interaction e,e;/r;. The division of the interaction 
between photons and Coulomb field is arbitrary—both interactions come from the same basic 
processes. The Hamiltonian (2.161) is written in the Coulomb gauge where V- A= 0. 
Another choice of gauge will result in a different division between photon and Coulomb. The 
basic forces between the particles are the same regardless of how the gauge is selected. 
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Now it is time to talk about Green’s functions. The scalar potential (r, t) has a Green’s 
function. The potential from a point charge is 


Wr) = — (2.164) 
The factor 
4 2 
v, = a (2.165) 


which has already been used for the Coulomb interaction, is in fact just the Green’s function 
of the longitudinal potential. It has no frequency dependence because it is instantaneous. 
After all, in the rules for constructing diagrams, in Sec. 2.8, it was treated as a Green’s 
function. If two electrons interacted by phonons, put in the phonon Green’s function and 
vertex, 


IM,|D(q, ©) (2.166) 


while if two electrons interact by electron—electron interactions, put in the factor v, where 
4n/q* is regarded as the Green’s function and e? as the vertex. The Coulomb interaction is 
treated as a Green’s function, on equal footing with the phonon Green’s function. 


Since v, is a Green’s function, it has a Dyson equation of the usual form: 


v(0) = 


Ug 


7 2.167 
l — Ug P (q, o) ( ) 
The factor P(q, œ) is the self-energy or polarization operator. Its properties are discussed 
extensively in Chapter 5. There is one result which can be obtained with very little effort. 
Consider the form of Maxwell’s equations in a homogeneous material with an isotropic 
dielectric constant e: 


V-B=0 
VxE=0 
EV -E = 4p 
EO 4 
VxB= E+] (2.168) 


If these equations are solved in the usual way, further equations are obtained for the scalar and 
vector potentials: 


d°r' p(r’) 
Wir) = SFER a 
5 Ee F Ar, 
A —--——~A= —-— 
V 2 ap > j (2.169) 
Here the Coulomb Green’s function is 
_ Ug 
0 = — (2.170) 
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If this result is regarded as equivalent to (2.167), it gives a formula for the dielectric function: 

e(q, œ) = 1 — v,P(q, ©) (2.171) 
On the left-hand side this result has been generalized to include the case where the dielectric 
function depends on both q and œ. Equation (2.171) will serve as the definition of the 


longitudinal dielectric function. It arises from the self-energy parts of the Coulomb potential. 
The Green’s function for the vector potential is 


D,y(k, t — t) = —i 2 (TA,(k, à, t)A,(—k, A, 2) (2.172) 
À 


where p, v are the x, y, z components. The vector potential is defined in (2.163). The sum over 
À is the sum over the two transverse polarizations of the light, while ¢, are the polarization 
vectors for each component. The free propagator at zero temperature is evaluated with the 
states |), and ọ(| as the photon vacuum: 


2ni , l 
DO(k,t—1¢) = — I E(k, NEW(HK, Ap (lTla eK + ate] 
k à 


x [ae Ow + al, e" ]lo 
= - FYE MOC — DeK ollaaa 
+ OM = ek 4 (la9,445.1)o1 
= — meine’ | 2 MA (2.173) 


and its Fourier transform is 


(0.0) 
DO (k, @) = l dte? YDA (k, t — r’) (2.174) 


—&O 
An 
= — eT 2.175 
o — ae ia HY l ) 
Next, consider the last factor in the above equation `, §,,. The unit tensor is 
Õuv = IX + YY + 2Z (2.176) 
Or, in a coordinate system in the direction of k, 


Sy = 2 bE, + kk (2.177) 


The factor >°, u6, is the unit dyadic for directions perpendicular to k. The Green’s function 
is 


4r[S,y — (k,k,/k*)] 


D© = 
HY o? — wf + ið 


(2.178) 
This expression is referred to as the photon Green's function. Keep in mind that the inter- 
action between two charges occurs via both the scalar and vector potentials. How the inter- 
action is divided between scalar and vector potentials is somewhat arbitrary and is determined 
by the gauge condition. After making this choice, the word “photon” is assigned to the vector 
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potential part. This division between photon and Coulomb is arbitrary, and both parts should 
really be viewed as arising from photons. 

The photon Green’s function also obeys a Dyson equation. Since it is a matrix quantity, 
one must be careful about the treatment of indices. Previous Dyson equations have only scalar 
quantities, where the Green’s function is defined as a series expansion. The equation for 
electrons is actually of the form 


G(p, E) = G"(—p, E) + GO (p, E)E(p, E)GQ, E) (2.179) 


Since all the quantities are scalar functions, this equation could be solved immediately for 
G(p, E) which obtains (2.114). But for the photon Green’s function, the equation is 


Dy, =DO + 2 DC msD5y (2.180) 


where 1,5(k, œ) is the self-energy function, which is now a 3 x 3 matrix. Each term in the 
equation is a function of (k, œ). However, in homogeneous materials all matrices are of the 
form 


Hv uv 2 
Ky ky 
Dw = (5. — “ep (2.181) 
kk. 
Tyy = 8, ym) + ae (2.182) 


where the factors D®, D, x“, and n® are scalars. The self-energy function ,,,,(k, œ) has the 
form shown, which is the most general dependence on k of a matrix function. Now it is 
simple to do the summations over the matrix components: 


EB — Gan + Birs, — ha) = Buy — Ryka 


and there is a scalar equation for the photon Dyson’s equation 


DO 
4n[8,,, — k,,k,/k?] 
yee (2.184) 


o? — o? — 4rr® 


The transverse photon has no dependence in its self-energy on the longitudinal part of the 
self-energy n. The equation for the photon Green’s function is (2.184). 

Actual solids are periodic, rather than homogeneous, and the matrix form of T, may be 
more complicated than (2.181). The self-energy function 1,,, will be transverse but not 
necessarily perpendicular to k. Then one must start from (2.180) and actually solve for the 
various components of D,,,. 

In a homogeneous medium with dielectric constant £, (2.169) is the equation obeyed by 
the vector potential. The double time derivative is multiplied by £. Since this yields the œ? 
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term in the Green’s function, the appropriate Green’s function in a medium with dielectric 
function € is 


AmB yy — ky ky /k?] 


= 2.185 
EY go? — we + ið l ) 


This equation for D,,, is equated with Dyson’s equation (2.184), for the dielectric function, 
which gives 


4n 
— — nC) 
This result is not very useful for crystals, since there the form Eyy(k, œ) is a tensor, and is not 


the scalar function which we have given above. The scalar result is correct in crystals in the 
limit of k — 0: 


. 4T 
lim E,y(K, @) > Spy Í — Mk, o)! (2.187) 
It is interesting to note that in the limit of k — 0, this transverse dielectric function becomes 


exactly equal to the longitudinal one (2.171) at k — 0. This identity is not obvious yet and 
will be proved in Sec. 3.7. 


PROBLEMS 


1. Show explicitly that 


ifs ELF 
x | dt, | dt, | dt TIV (tOV (V (6) 


— | dt, i dt, f dt;V(t,)V(t))V(t;) (2.188) 
0 0 0 


2. For the phonon Green’s function D(q, t — t), let V(t) be the electron-phonon interaction and 
evaluate all the n = 2 diagrams. Which are connected, and which are disconnected? Draw the Feynman 
graphs for each term. 


3. Let V(t) be the electron-phonon interaction in the expansion for the electron Green’s function 
G(p, t — t). What are the contributions from the different connected diagrams for n = 4 (two phonons). 
Just draw the graphs. Also draw all the graphs for the disconnected diagrams. 


4. Let V(t) be the electron-electron interaction in the expansion for the electron Green’s function. 
Evaluate the term for n = 1, including all equations and Feynman graphs. Also draw the connected 
diagrams for n = 2. 


5. Prove the Feynman result e4+? = e4e%e—(!/2)I4.41 which is true only if [A, B] commutes with both A 
and B. Hint: Recall that 


t 
eit) — e Hho T apl =: | dt, (el Mo rey] (2.189) 
0 
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Use the same method to prove that 

e4tB) — eT, exp | depe | (2.190) 
and evaluate the integral for s = 1. 
6. Evaluate the wave vector integrals at zero temperature for the exchange energy in (2.138). 
7. Express U,(t) = ete- and U,(t) = ee! as time-ordered exponential integrals for both 
cases t > 0 and ¢t < 0. You will need to use the operator T~}, which arranges operators in their inverse 


time ordering. 


8. Evaluate the first interaction term in Dyson’s equation (2.160) and show that it does give (2.155) 
when (x3, X4)uy = öt (x — x4) V (x3) 8yy- 


9. Show that Eq. (2.159) for G is identical to (2.118) using the result in (2.160). Self-energies obey the 
same relations as the Green’s functions: i.e., &, = X” + X<, etc. 


10. Calculate the average value of the square of an ion displacement (QF) 


h f dfg 2N,+1 
(0?) | q q 


~ 2p J (2n)4 O, 
and show that it diverges in one and two dimensions due to long wavelength fluctuations. 


(2.191) 


Chapter 3 
Nonzero Temperatures 


3.1. INTRODUCTION 


Experiments are done at nonzero temperatures. Since one goal of many-body theory is to 
explain experiments (another is to predict them), the theories should be done at nonzero 
temperatures too. It is often unnecessary if the temperature is small compared to other 
energies in the problem. But often temperature is important, and here it will be incorporated 
into Green’s functions. The nonzero temperature formalism was originated by Matsubara 
(1955). It will actually be easier to use than the zero-temperature theory of Chapter 2, so that 
the Matsubara method will be used throughout the remainder of the book. The zero- 
temperature result is always easily obtained from the nonzero-temperature result by just 
setting T = 0. 

At nonzero temperature, it is assumed there is something with a nonzero temperature. 
That is, our particle, whether electron, phonon, or spin, is interacting with a bath of other 
particles which have an average energy. The exact state of all these other particles is not 
known, since they are fluctuating between different configurations. All that is known is the 
temperature, which is related to the mean energy. 

When defining the Green’s function, one must average over all possible configurations of 
the system. A possible Green’s function for the electron is 


Trle PE C (OCho()] G.1 
Tr(e BH) l 
Cyo(t) = e Coe (3.2) 
where the symbol “Tr” denotes trace and is the summation over some complete set of states: 


Tr = $ (n|--- |n) (3.3) 


n 


The definition (3.1) would be suitable for a Green’s function and is iG~ (p; t, £t). However, it 
has one drawback which makes its use unwieldy. Usually the Hamiltonian is written 


as a part Họ that can be solved exactly and a part V which remains and becomes the 
perturbation. However, V now appears in two different places. First it is in exp(+iHt), which 
can be expanded in the usual S matrix. But it also occurs in the factor exp(—BH). There must 
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also be a perturbation expansion on the thermodynamic weighting factor. Of course, it is a 
nuisance to be doing two different expansions at once. 

The Hamiltonian enters both terms as an exponential factor. The factor of B = 1/k,T 
can be considered to be a complex time. The Matsubara (1955) method does just the 
converse; it treats time as a complex temperature. The object is to treat t and B as the real and 
imaginary parts of a complex variable, which will require only one S-matrix expansion. 

Another motivation for the Matsubara method is provided by examining the thermal 
occupation numbers for bosons (e8®s — 1)! and fermions (ec + 1)~!. Each of these can be 
expanded in a series (€, = Ep — H): 


1 1 


1 oo 

ne (Gp) = ebe +1 =3t Brno 3 œ (2n + 1)in/B — ép (3.9) 
1 1 1 2 1 

nalog) = boa] 2 aT B>. 2nin/P — Og (3.6) 


These series can be derived from a theorem which states that any meromorphic function may 
be expanded as a summation over its poles and residues at those poles. The boson occupation 
factor (eP®%a — 1)’ has poles at Oq = 2nin / B and the fermion factor (e*» + 1)~' has poles at 
Sp = (2n + lin / B. It is convenient to define the frequencies at the pole 


=(2n+1)n/B, fermions 
©, = 2nn/B, bosons (3.7) 


where the fermions have poles at odd multiples of m / B, while bosons have poles at even 
multiples, including zero. Both summations above can be written as 


Y— oS- l 


— 3.8 
n IOn — Oq n 10, — Sp (3-8) 


where for fermions we sum over only odd integers and for bosons over even integers. The 
factor 


l 


iO, — Dg 


(3.9) 


has the nature of a Green’s function. Indeed, it is the noninteracting Green’s function in the 
Matsubara method. 

In the Matsubara method, time becomes a complex quantity which is usually called 1, 
where t = it. Green’s functions are functions of t with domain 


—B<t<B (3.10) 


Fourier transform theory states that if a function f(t) is defined over the range —B < t < B, 
then its Fourier expansion is 


f(t)= a + Sla, cos ) +b,s m(=) (3.11) 
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where 
1 fP NTT 
a, == dtf (t) cos| —— (3.12) 
BJ_p B 
1 fÈ , (NTT 
b, == dtf (t) sin{ — (3.13) 
B J_. p 
Another way to write the Fourier transform is to define 
f(i@,) =3 Pla, + ibp) (3.14) 
and hence 
1 x , 
fM=_ 2 ef io) (3.15) 
B n=— 00 
1 (° | 
flio) = 5 | drf (t)e™™/P (3.16) 
—B 


There is still a further simplification which can be achieved. The boson Green’s functions 
have the additional property that 


boson: f(t) = f(t + B) when — B < t < 0 (and 0 <1+6 < B) (3.17) 
Divide the integral (3.16) into its negative and positive regions: 
1} 7° | 0 | 
flion) = 5 | daif (tje”t P? + | daf (t)ei”™/P (3.18) 
0 —B 


and change variables in the second term from t to t + B, which gives 


B 
f(io,) = >a + e”) | dxf (x)ein"/ (3.19) 


This expression has the feature that f(im,,) = 0 whenever n is an odd integer. For bosons, 


B 
fio) = | dne™f(t) 


f(t) = S3 oF ien, ) bosons (3.20) 


©, = 2nnkgT 


This result agrees with the previous observation (3.6) that boson frequencies contain only 
even integers. 
Similarly, the fermion Green’s functions will have the property that 


fermions: f(t) = —f (t + B) when —B <Tt<0 (3.21) 


The same manipulations on the integral in (3.16) give 


B 
f(io,) = =q — em | daf (t)e”™/P (3.22) 
0 
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In this case f(im,,) = 0 if n is even, while for n an odd integer, 


B l 
flio,) = | dre% f(r) 


f(t) = 55 eT Cio,) fermions (3.23) 


©, = (2n + 1)tk,T 


These equations are identical in form to those in (3.20). The only difference is whether the 
frequency œ, has even or odd integers. This pair of equations will be used often to define the 
Fourier transforms of Green’s functions. 

At this point the reader is probably overwhelmed with complex frequencies and complex 
times. One might argue that this suffering is necessary when including nonzero temperatures. 
Actually, the forecast is not as gloomy as that. You will find the Matsubara method is very 
easy to use. It is particularly good for evaluating high-order diagrams with many internal 
lines. In fact, it is a remarkably easy method to use in practice. Except for the notion of 
complex times and frequencies, it is not that much different from the zero-temperature 
Green’s functions which we defined in Chapter 2. 

A second great merit of the Matsubara method is that it leads us directly to physical 
results. In Secs. 3.7 and 3.8, some Kubo formulas are derived for the exact definitions of 
physical quantities such as the electrical conductivity, thermal conductivity, magnetic 
susceptibility, etc. In Sec. 3.3, it will be shown that these correlation functions are just 
retarded Green’s functions. Finally, it will be shown that the Matsubara Green’s functions lead 
directly to the retarded functions. Our Matsubara functions will be functions of the complex 
frequencies iw,, such as f (iw, ). The equivalent retarded function is obtained by replacing io, 
by œ + ið, where 5 is infinitesimal and i = /—1. This step is called an analytical con- 
tinuation. In practice, one just takes the formula one has derived for f(iw,), erases iœ, 
everywhere, and replaces it by œ + ið. This simple procedure yields the retarded function, 
which is needed for the physically measurable quantities. The Matsubara technique is a direct 
method of calculating the quantities which can be compared with experiment. 


3.2. MATSUBARA GREEN’S FUNCTIONS 


The electron Green’s function is defined as 


G(p, t— T’) = —(T,Cyg(t)Ch (7) (3.24) 
G(p, t — T’) = —Trle PE-N- T pt HWM C e70- E-N) 

x Chet H] (3.25) 

e PO = Tre Bb) (3.26) 


These definitions have several features and conventions which need to be explained. First, the 
bracket (---) in (3.24) has the definition implied by the equivalent equation (3.25). The 
bracket (O) on an operator O means to take the thermodynamic average, which is the trace 
over the complete set of states. Second, the Hamiltonian is now replaced by H — uN, where u 
is the chemical potential and N is the particle number operator. A grand canonical ensemble is 
used, where the number of particles is variable. This definition of the Green’s function applies 
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to a many-particle system. It can also be used very successfully for one particle in an empty 
band. In the latter case, the analytical continuation is taken as ia, — E + u + ið, and the 
chemical potential will vanish from all expressions. One is not bothered by the fact that 
Bu < 0 in one-particle systems at nonzero temperatures. 

In a many-electron system, the chemical potential is retained in the formalism. The 
analytical continuation is im, — E + id and energy is measured from the chemical potential 
(Fermi energy). The factor 7, is a t-ordering operator, which arranges operators with earliest t 
(closest to —ß) to the right. It serves the same function as the time-ordering operator in the 
zero-temperature Green’s functions. The subscript t 1s affixed to T to distinguish this operator 
from the temperature. The thermodynamic potential Q in exp(—BQ) is the usual normal- 
ization factor for a thermodynamic average. The script symbol G has been used for these 
Matsubara functions. This script will always be written to alert the reader that these are 
Green’s functions of complex time and complex frequency. 

In (3.24) the Green’s function on the left has been written as a function of the difference 
t — 7’, although the right-hand side is not obviously a function of only the difference. Now 
prove this to be the case. First, write the Green’s function for the separate cases for t > t’ and 
T<v: 


K =H — uN (3.27) 
Gp, t— T) = —O(t — V)T re PED eE Ce OR Che] 
+ O(c! — Tre PR De Eci eT Coe] (3.28) 


The sign change in the second term appears whenever two fermion operators are inter- 
changed. Next, use the theorem that the trace is unchanged by a cyclic variation of the 
operators 


Tr(4BC --- YZ) = Tr(BC - - - XYZA) (3.29) 
to move the factors of exp(—tXK) to the left. So Eq. (3.28) can be rewritten as 
G(p, t — T) = -OC — V)Tr(fe E e PEDE C eK CH 


+O — 1)Trle Ke PR DeK CT el KC (3.30) 
Next commute the exponential operators, 
et K o-B(K—-Q) — oe B(K-Q) p -TK (3.3 1) 


since they both contain the same operator K [the thermodynamic potential Q is not an 
operator but is a scalar function of B and u, as defined in (3.26)]: 


Gp, t — T) = —O(t — V)Trfe PAM) eK Cpo TE Cis] 
+O — 1)Trle PR DeK Ch OE C o] (3.32) 


The right-hand side of this equation is now a function only of the combination t — t’. The 
Green’s function can always be written as a function of this difference. It enables us to drop 
one of the time variables since it is unnecessary. An equivalent definition of the Green’s 
function is 


G(p, t) = —(T,Cyo(t)C}(0)) (3.33) 
= —Trle PRT (e* Ce Che) (3.34) 


114 Chap. 3 e Nonzero Temperatures 
Next the Green’s function is examined for t < 0 to verify that it does have the property 
asserted in (3.21): 

T<0: G(p,t) = Tr(e PR MCh eK Ce) (3.35) 


By using the cyclic property of the trace several times, the above equation can be rearranged 
into 


t<0: G(p,t) = Trebe” Ce PK CT) (3.36) 


The factor of exp(BQ) is not cycled since it is not an operator. These terms can be regrouped 
by adding exp(+BK) to the first terms to give 


T<0: GP, T) = Tr(e PEDD C se HPE CT) (3.37) 
The term on the right is —G(p, t + B) when 0 < t + B < B. These steps show that 
—B<t<0: G(p,t) = —G(p,t+ B) (3.38) 


as was asserted earlier in (3.21). The above identity proves that the Green’s function can be 
expanded in a Fourier series of the type in (3.23): 


B 
G(p, io,) = | dte'""G(p, T) (3.39) 
0 


G(p, 1) = 5 eG(p, io) (3.40) 


Equation (3.39) serves as the definition of G(p, iw, ), where iw, is always an odd multiple of 
T/B for fermions. 

The noninteracting Green’s function, or free-particle Green’s function, is obtained from 
(3.34) by using for the Hamiltonian 


H = Hy = $ &)Ch Cy, (3.41) 
po 
K = Ko = 6p Cfo Cpo (3.42) 
po 
Ép = Ep — Ht (3.43) 
The t evolution of the operators is just 
Cyo(t) = e™ Cpo ™ = e Co (3.44) 
Che(t) = ech eo = e Ch (3.45) 
which is easily derived from the Baker—Hausdorff theorem: 
1 1 
eCe™ = C + [A, C] + 5 4 IA, CH + z4, I, A, CI + ++: (3.46) 
The t dependence of the Green’s function is 
GO (p, 1) = -O(1)e™" (Cpo Cho) + O(—t)e* (Cho Cpo) (3.47) 
= —e7***{@(x)[1 — ne(E,)] — Odar Ép) (3.48) 


= —e™™"[O(1) — nr (ép) (3.49) 
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where n,(§,) is the expectation of the number operator: np(§,) = (ChoCpo)s which from 
elementary statistical mechanics has the form 


1 
np (6p) = oop 4 1 (3.50) 
It is also easy to obtain the Green’s function of frequency: 
B . B 
G(p, ion) = l dren gO (p, 1) = —(1 — ne) | dre nS») 
0 0 
Plian —§p) 
Om seo.) = a ede = 1) 551 
G (p, iW, ) ~ iO, _ Ep ( 5 ) 
The second term in the numerator may be simplified by remembering that 
ipo, = i(2n + 1)n (3.52) 
em = —] (3.53) 
which gives 
_ “Bop 4 | 
G(p, io) = Lame FD (3.54) 
io, — Sp 
1 
O (p, io,) = ———— 3.55 
gp, i0) = E (3.55) 


since it is easy to show from (3.50) that [1 — np] = 1/(e7*%» + 1). Equation (3.55) is the 
noninteracting Green’s function for electrons. 

The result for G does have the form suggested in Sec. 3.1. Temperature information is 
still contained in this expression but now only in the frequency (2n + 1)n/B. Later it is shown 
how the occupation factors reenter the expressions when the diagrams and correlation 
functions are actually evaluated. 

The phonon and photon Green’s functions are defined in the same fashion. They are 
obviously similar to each other, so only the derivation of the phonon Green’s function 1s 
presented. The photon results will be stated at the end. For phonons, in the interval 
—B <t < P the Green’s function is 


D(q, t — T) = —(T,A(q, 1)A(—q, 7/)) (3.56) 
A(q,t) = e” (aq +a je (3.57) 


Phonons have no chemical potential, since one can make an arbitrary number of them, and the 
tau dependence is governed just by the Hamiltonian. Again one can show that the right-hand 
side of (3.56) is only a function of t — 1’. It is not necessary to keep two t variables, and one 
can instead define 


D(q, 1) = —(T,A(q, )A(—4, 0)) (3.58) 
Next, examine the behavior for negative tau: 
t <0: D(q, t) = —(A(—q, 0)A(q, 1)) (3.59) 


= —Trle P-A qe" A(q)e (3.60) 
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Using cyclic permutations of the trace turns the trace into 


t<0: D(q,t) = —Tr[e e" (qe 9) 4(—-q)] (3.61) 
t<0: D(q,t) = —Trfle PY oP 4(q)e PE 4(—q)] (3.62) 

which proves 
—B<t<0: D(q,t)=D(q,t+ 8) (3.63) 


The right-hand side of the equation is the Green’s function when 0 < t+ B < B. This identity 
satisfies the general conditions asserted in (3.17) for boson correlation functions. The Fourier 
transform has the following form in (3.20): 


PB 
D(q, io,,) = l dte” D(q, T) (3.64) 
0 
1 
D(q, t) = B e '" D(q, iœ@,) (3.65) 
O, = 2ntkpT (3.66) 


Equation (3.65) provides the definition of the frequency-dependent Green’s function. 

The difference between (3.17) and (3.21) is just a sign change. The fermion functions 
have a sign change because their operators obey anti-commutation relations, while the bosons 
have no sign change because their operators obey commutation relations. Of course, this sign 
change is a result of the fundamental difference between bosons and fermions. The sign 
change is also responsible for the sign change between +1 in their two forms of thermal 
distributions: (e%» + 1)~! vs. (e8®a — 1)~!. This sign change is very important, and one has 
to keep track of it carefully in fermion problems with many operators. 

For phonons, the noninteracting or free-phonon Green’s function is obtained by taking 
H =H) = dq OAT which for the t variation of the operators yields 


a,(t) = evo aye" =e Mag (3.67) 
ai(t) = eth gi eM = eral (3.68) 
Always remember that [a,(t)]' = al (1). The noninteracting Green’s function is 
DO (q, 1) = O) llaqe + at ge™s)(a_g + at) 
— O(—1)((a_g + a$) (age ™ + at ge") (3.69) 
Capital letters are used to signify the thermal expectation value of boson number operators: 
N, = (alaq) = ng(,) = a (3.70) 
Na +1 = (aqai) (3.71) 


Averages such as (a,a,) and (alal) yield zero since they vanish for each term in the trace. 
The Green’s function of t can be written as 


DO(q, 1) = -OMIN +1) + Ne] 
— O(—1)[Nges + (Ny + Des] (3.72) 
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The Green’s function of frequency is 


p 
D(q, io,) = l dtre DO (q, 1) 
0 


Blio,—O4) 1 Bonto) — ] 
(Pa = 1) oo 63.73) 


. q . 
iM, — Dg iO, + Wg 


= -Jon + 1) 


The terms in the numerators may be simplified by noting that for bosons exp(iœ,ß) = 1 so 
that the Green’s function is 


(3.74) 


DO(g, io,) = -|o 4 ped +N eD] 


. q . 
1M, — Oq 1D, + Oq 


Using Eq. (3.70) shows that the first numerator equals —1, and the second is +1: 
DO(q, io,) = — $ + l (3.75) 
a ID, — Og iO, + Og 
20 20, 


D(q, io,) = — 5 = - (3.76) 
(ioy — o2 o2 + œ? 


This Green’s function has a simple form. It is almost identical to the zero-temperature case 
(2.72), and the only difference is the use of complex frequencies instead of real ones. Note 
that D®(q, iœ,„) is always negative. 

The photon Green’s function is also identical to its zero-temperature result, except for 
complex frequencies. The fundamental definition is 


Dy (k, 1) = — (TA, (k, à, 1)4,(—k, à, 0)) (3.77) 
À 
2 
A, (k, à, 0) = (k, A) (=) (ay, + af) (3.78) 


where the operator A, is the usual vector potential operator in (2.163). The free-photon 
Green’s function is 


4n(5, — k,k,/k*) 


D k, ia,) = 
uv ( l n) w2 + oF 


(3.79) 
which should be compared with (2.178). 

This section is ended with a comment on notation. The following three forms for the 
Green’s function are equivalent and will be used interchangeably: 


G(p, ip,) = GQ, ip) = G(p) (3.80) 
D(q, iœ) = D(q, iw) = Dg) (3.81) 


The form on the left has been used so far. In the second form, ip has been used instead of ip,,. 
They mean the same thing, since the i in ip is enough information to alert the reader that 
complex frequencies are being employed, which are always discrete. Hence the n subscript is 
redundant. In the last form, a four-vector notation p = (p, ip) is used, and the script form of G 
is sufficient to alert the reader to the use of Matsubara Green’s functions. 
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3.3. RETARDED AND ADVANCED GREEN’S FUNCTIONS 


The retarded and advanced Green’s functions were introduced in Sec. 2.9. They play an 
important role in the nonzero-temperature theory. Their properties are discussed in this 
section. Their importance comes from the fact that all measurable quantities, such as 
conductivities or susceptibilities, are actually retarded correlation functions. The goal of many 
calculations is to calculate a retarded function. There are several different ways to obtain 
them. One is to use a real time theory even at nonzero temperatures. This method is obvious 
to the beginner, but is the hardest way. The second method, which is used most often, is to 
first calculate the equivalent Matsubara function of imaginary frequency. It is shown below 
that the retarded function is obtained from the Matsubara function by simply changing iœ, to 
œ + id where ô is infinitesimal. The Matsubara function is the easiest one to calculate because 
its S-matrix expansion is simple. The retarded function is most easily found from the 
Matsubara function. 

The retarded Green’s functions may be defined for both zero and nonzero temperatures. 
The retarded Green’s function for an electron in state p is 


Galp, t — t) = iO — CChAC) + CL) CoO) 
= —iO(t — £)Tr{e PR [C, OCT.) + Ch ()C,6(0]} 68.82) 
K=H-wN, C,,(t) = e Cpo ™ (3.83) 


The brackets (- - -} indicate thermodynamic average, as is explicitly shown on the second line. 
The square brackets mean nothing in particular; they are used to group symbols together. The 
retarded Green’s function depends on real time, not tau. The tip-off for this is the i factor in 
front which belongs with all real-time Green’s functions. The Green’s function operates only 
for t > t', which makes it causal. One starts a signal at one time ” and measures it later at t. Of 
course, actual systems are causal, which is why these Green’s functions are the ones of 
physical interest. The argument of the Green’s function is an anticommutator at different 
times. In the limit that the times become equal, the anticommutator becomes unity, 


= lim{Cyg() Cho") + Cholt’)CpoO) (3.84) 


since it just becomes the usual fermion anticommutator. The plus sign in the middle of the 
two terms is an important feature for retarded Green’s functions of fermion operators. The 
left-hand side of (3.82) indicates that the retarded function depends only upon the difference 
(t — t). This feature can be shown by manipulations on the trace similar to those in the last 
section. 

For phonons, the retarded Green’s function is 


Dyet(G; t — t') = —iO(t 7 t')(A(q, t)A(—q, t) ~~ A(-q, t')A(q, t)) (3.85) 


It is very similar to (3.82) in that it is for real time, is also thermodynamically averaged, and is 
defined only for t > t. However, the sign in the middle is now minus, which corresponds to 
the fact that bosons obey commutation relations. For both electron and phonon retarded 
functions, the right-hand side can be shown to be a function only of t — ť', as is indicated in 
the argument of the Green’s function on the left-hand side of the definition. 
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Retarded Green’s functions are needed for many types of operators. These operators will 
usually be products of electron or boson operators. For example, let us define the operators 


U = Me (3.86) 
ijk 


The operator U is bilinear in the operators C;, where M; is just a matrix element. The operator 
U is regarded as having boson properties, regardless of whether both Cs are fermion or both 
boson operators; the case is excluded where one C is boson and one is fermion. U is boson 
because it acts as a composite particle. This bilinear form will be encountered quite often, 
since it is characteristic of some important operators such as the current and density operators. 

The retarded Green’s function for the operator U is denoted U and is defined as | 


Ult — t) = —iO(t — IUU E) — U NUO) (3.88) 


This definition is similar to (3.85), with the important feature that it has the minus sign in the 
center of the bracket, which is the case for all boson operators; that is, for any operator which 
is a product of bosons or an even number of fermions. 

However, an operator such as V above is considered fermion if it is a product of an odd 
number of fermions. Its retarded function is 


Valt — r) = iO — POVIE) + VIOO) (3.89) 


which now has the plus sign in the bracket. 
All these retarded functions have Fourier transforms defined by the usual convention: 


OO 
Gay(P, E) = l dte -G (p, t — t) (3.90) 
o 
Dret(G; ©) = | dt DD (q, t— t) (3.91) 
—0o 
— cO . — 
i,(®) = l dtc T(t) (3.92) 
—-C 


The advanced Green’s function for each of these 1s defined by 


Gaav (P: t — t) = IOC — NIC, OG) + GOGON (3.93) 
Dag (q, t — t) = iO(¢ — t)(A(q, HA(—q, t’) 

— A(—q, t)A(q, 2) (3.94) 

Usa(t — £) = iO — )(UMUT() — VINU) (3.95) 


The only two differences are the sign change in front and the fact that the time domain is now 
t' > t, which is just opposite of that for retarded functions. Their Fourier transforms with 
respect to frequency are defined in the usual way as in (3.91). 
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The advanced functions of energy turn out to be the complex conjugate of the corre- 
sponding retarded function. To prove this fact, first start with the advanced function, then take 
its Hermitian conjugate, and finally invert its time variables. The result is the retarded function 

Usa(t’ — t) = iO — ¢)\(UY)UT® — UTOU)) (3.96) 
Uaay(t’ — t)' = Urer(t — t’) (3.97) 


Now take the Fourier transform of both sides, 


ore) _ oo oe 
dt) Ug (t — t)! = l dt et Unavts)" 


—0O 


Ülo) = | 


—0oO 
where the last step is obtained by changing variables t, = ¢’ — t. These steps prove that 
Uret(®) = Ua (0)* (3.98) 


This result can be generalized to any of the retarded and advanced Green’s functions. It is 
sufficient to find the retarded function, since a simple complex conjugation derives the 
advanced one. 

A particular representation for these Green’s functions is introduced. This representation 
is a formal one, which is not generally useful for calculating physical quantities and deter- 
mining numbers. However, it is useful for proving theorems and in particular for relating one 
Green’s function to another. This representation uses the complete set of states |m) which are 
the exact eigenstates of K = H — uN. Usually the exact eigenvalues and eigenstates of K are 
not known. However, in principle they exist, which is sufficient for proving theorems. The 
eigenvalues of K are denoted E,, 


Kim) = E,,|m) (3.99) 


This complete set of states will be used in the thermodynamic average: the symbol Tr denotes 
trace, and the set |n) are used for this summation: 


Ult — t) = —i@(t — t)? Y (nje PLUA A UIE) 
— UŻ (t) A U@O]ln) 


The above equation has two points marked by an insertion sign “^”. In both places, a 
complete set of states is inserted which is unity, 


1 = ` |m} (m| (3.100) 
which gives 
Uralt — 1) = —iOl — tye? Ye PPL U Olm) mU) 
— (n|UT(¢)|m) (m|UOIn)] (3.101) 
The matrix elements are easy to evaluate 


(n|U(t)|m) = (nje Ue"* m) = (n|U|m) e" EEn) (3.102) 
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which gives for the retarded function 


Une(t — t) = —iO(t — te? Y e PEnfel En Em) | n U(t)|m) |? 


— e OE, Ew) (m|U)|n) [7] (3.103) 


In the second term exchange the dummy summation variables n and m so that the matrix 
elements (n|U|m) are the same in each term: 


Une(t —f)= —i®(t — te?! y HnlU |m Ze OE n) 
x [e P7 — @ Pen] 


This formula is the result for the retarded Green’s function of time. The Fourier transform is 
taken to give the frequency function, 


(0,6) 
U,«(0) = -i| eO dte? F | (n|U|m) |2eEx—Em) [eBEn — e-PEn] 
0 m,n 


e BE,, — e BE,, 


— 3.104 
o +E, —E,, + iò ( ) 


= e? © |(n|U|m)|? 


where ið is added to the frequency to ensure convergence at large times. 
The equivalent Matsubara function for the operator U is defined by a script symbol: 


U(t) = —(T,U(t)U'(0)) (3.105) 
B 

U(io,) = | dte’" U(r) (3.106) 
0 


The representation |) is also applied to this expression 


t>0: Ula) = -e° Y (nje E U(t)|m) mU (0) |n) (3.107) 
U(t) = —e®? Y | (n|U |m) eP e Er Fn) (3.108) 
The frequency transform is 
U(iw,) = -e° Y |(n|U|m) 2e?” f dre'r" e"Er-En)| 


2 oe BEn — e` BEm 
iO, +E, — E 


n n m 


= e? © |(n|U|m)| (3.109) 


where again exp(Biœ,) = 1 for bosons. This result should be compared with the retarded 
function in (3.104). They differ only in the frequencies in the energy denominator since the 


Matsubara result has iw, where the retarded function has œ + id. The Matsubara function can 
be changed to a retarded one with just this alteration: 


change 


io, > © + jg An) = U,«(0) (3.110) 
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This step is called an analytic continuation. The same method can be used to show the same 
identity for the other Green’s functions: 


change , 
iw, > O + id G(p, iW, ) — Gret(P, o) (3.111) 
change 


D(q, i = D .112 
iw, > © + ið (q, i@,) ret (4; o) (3 ) 


This relationship to the retarded function is one of the primary reasons that the Matsubara 
functions are so useful. After they are evaluated, this simple analytical continuation then 
yields the retarded function, which is the function of physical interest. The advanced func- 
tions are obtained by the analytic continuation iw, —> œ — id, which is obviously true since 
the advanced function is the complex conjugate of the retarded function. 

Another quantity of great importance is the spectral function, which is also called the 
spectral density function. It is the imaginary part of any retarded function multiplied by 2; for 
example, 


R(w) = —2Im[U,,,()] (3.113) 
B(q, @) = —2Im[D,e(q, o)] (3.1 14) 
A(p, ©) = —2Im[G,e(P, @)] (3.115) 


There is not a formal symbol for this quantity, so several are used. From the representation of 
the retarded function (3.104), the only complex part is 


l l 


$$. = P— — ind E,— E 
o + E, — Ep + iò o + E, — En inòlo + En — En) 
which gives for R(œ) 


R(@) = e° E |(n|U|m) P {e P” — e Pm }208(@ + Ep — En) 


The temperature factors can be regrouped to give 
eo PEn gy — @ PEn-En)) (3.116) 
so that 
Rœ) = 2n(1 — ee? E |(n|U|m) |?8(@ + E, — En) (3.117) 


It is now possible to write the retarded or Matsubara functions as integrals over these 
expressions: 


- (~ dæ Ræ) 
Ure) = B 2T © — w +ið 
, L% do’ R(o’ 
Ulio,) = | E © L (3.118) 


These identities follow directly from the prior results (3.104), (3.109), and (3.117). An 
expression of this form is called a Lehmann representation (1954), and it was first used in 
quantum electrodynamics. 
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Much of this book is devoted to studying electrons, and the spectral function for elec- 
trons is calculated often in a variety of problems. Here is a good place to discuss some of its 
general features. The representation for the retarded function is: 


e BE,, + e BE,, 


Gree(P, ©) = h° Coc an) ? ————_—_— 
ret (P o) € 2 (nl po lM) | o +E, — E„ + ið 


(3.119) 
This equation is similar to the boson result (3.104), the only difference is the plus sign in 
e~ BE, 4 e~BEn The plus sign follows directly from the plus sign between the two terms in the 
definition (3.82). The spectral function for the electron is then 


A(p, ©) = 21e? E |(n|Cyg|m)|?[e Pe" + e] + E, — En) (3.120) 


This quantity is absolutely positive for all values of the variables (p, œ) since the right-hand 
side of (3.120) contains only positive factors: A(p, œ) > 0. This positiveness is an important 
feature, since A(p, œ) is interpreted as a probability function. The spectral functions for 
bosons do not have this property, since they are sometimes plus and sometimes minus. One 
can show, however, that they are always plus for œ > 0 and always minus for œ < 0, which 
follows from Eq. (3.117). 

Another important feature of the electron spectral function is obtained by integrating 
over all frequencies: 


° do 
=| 57, 4P 0) (3.121) 


This important theorem, actually a sum rule, is proved by integrating the representation 
(3.120): 


° do _ _ 
| 5 Alp, ©) = eM X MnlCyo len) | le PP +e Pe] (3.122) 
—oo n,m 


This expression can be simplified by eliminating the summations over n and m. This elim- 
ination is achieved by reversing the steps by which we derived (3.104). First one relabels n 
and m in the second term: 


= e? Y ePi (nl Cpolm) (m|Ch,|n) 


+ (n|C$olm) (m|Cpol)] (3.123) 
Then one can eliminate the summation over m to give 
= eê? D eP% (nl f[Cro Cio + CheCpolln) (3.124) 
The anticommutator in brackets yields unity, which proves 
CO 
d 
| T? A(p, ©) = e°Tr(e7PK) = 1 (3.125) 
oo 2T 


The spectral function may be obtained for a noninteracting electron. In the definition (3.82) 
set Ky = Hy — N and obtain 


Colt) = eC, (3.126) 
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so that 


Gie (Pp, t — t) = 10 (¢ — fe" (Cg Ch + Cho Cpa) (3.127) 


= —iO(t — re (3.128) 
The Fourier transform is 


l 


p 


ret 


(3.129) 


The factor of ið was inserted for convergence. It has one sign 6 > 0, even in many electron 
systems with a Fermi surface. The retarded functions do not have 6, changing sign at the 
Fermi surface, which makes them easier to use than the zero-temperature Green’s functions 
introduced in Chapter 2. 

The spectral function for the noninteracting Green’s function is 


A "(p, E) = 2n8(E — p) (3.130) 


It is just a delta function. The spectral function A(p, E) is interpreted as a probability function. 
It is the probability that an electron has momentum p and energy E. For a free, or non- 
interacting, particle then E = €, so the probability distribution is a delta function: Here there 
is only one value E for each €, and vice versa. Equation (3.130) is plotted graphically in Fig. 
3.1. For a fixed value of p, a plot of the spectral function vs. E is a sharp delta function (here 
given a small width to aid the eye) at E =&,. When A(p, E) is computed for interacting 
systems, the broad distribution is found as the one labeled A in Fig. 3.1. There is a band of E 
values for each p, which is not surprising. When the electron scatters, it has a nonzero mean 
free path, and there is some uncertainty in its momentum or energy or both. So p and E are 
treated as separate variables and both are summed over when evaluating physical quantities. 
The spectral function A(p, E) appears in these summations and gives the proper probability 
weighting between these variables. 

Another quantity to evaluate, for an interacting electron system, is the number of 
electrons in a momentum state p, which is 


Ny = (Cic Cpo) (3.131) 
Z PO 
g AN 
Ep w 


FIGURE 3.1 The noninteracting spectral function A is represented by a delta function, while the interacting 
spectral function A often has a nonzero width. 
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For a noninteracting electron system, the number is trivially given by (3.50). In the interacting 
system, again introduce the representation 


n, = e°? L (mle PS Chol) (al Cpl) (3.132) 


=e Le” "|(n|Cyglm) |? (3.133) 


This result n, should be compared with A(p, E) in (3.120). The two expressions differ in the 
factors 
(ePEn 4 e Pom S(E + E, — Ep) = e Prr (e PEE) 4 1)8(E + E, — En) 
= e` PEn (eE + 1)8(E + E, — E„) 


Consider the integral using (3.120) 


mee p AP E) = =e de "|(n|Cyglm)|" = np (3.134) 
® dE 
Ny = | zp EDA, E) (3.135) 
—OO 


The factor np(E) is the fermion occupation factor at nonzero temperatures. Remember that the 
energy E is measured with respect to the chemical potential. The thermal occupation prob- 
ability is determined by the energy E of the particles. The number of particles in a state p is 
obtained by summing over all energies E, weighted by the spectral function which gives the 
probability that a particle in state p has energy E, and also by multiplying by the thermal 
occupation factor np(E). This reasonable expression provides further examples of the use of 
the spectral function. In the limit of no interactions, then A(p, E) becomes the noninteracting 
spectral function (3.130), and Eq. (3.50) is recovered again. 

For phonons, the average number of phonons in a state q is Ny = (aha 
similarly show that 


q)» and one can 


° do 
2N,+1= (Al Aq) = | Sentosa, o) (3.136) 


The phonon spectral function B(q, œ) was defined earlier in (3.114). The factor ng(œ@)B(q, œ) 
is always positive for phonons and could serve as the temperature-dependent probability of 
having phonons with q and œ. The noninteracting phonon spectral function is 


BO (q, ) = 2n[5(o — Wq) — 0(@ + Oy)! (3.137) 


The retarded Green’s functions will be calculated from the Matsubara functions. In the next 
section it is shown that the Matsubara functions have a Dyson equation of the form 


l 
— Ëp — 2 (p, ip„) 
—20 


q 
3, 
oz + 04 + 20,P(q, io) (3.139) 


G(p, ipn) = (3.138) 


D(q, io,,) = 
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The self-energy functions È and P will be calculated according to rules described later. Define 
the retarded self-energies according to (3.111): 


change 
ip, > E + ið 


change 


2(p, iDn) —> Lret(D, E) = Red, (Pp, E) + iImd,.(p, E) 


P(q,i . 
i, >ot+ ið (q, iW) - Pret(G, o) (3 140) 


The retarded Green’s function will also have a Dyson equation 


l 


Gret(P, E) = = aR 
ret (P ) E + IÒ — Ép — Leet(P; E) 


(3.141) 
as derived from (3.111). The spectral function for the electron is rewritten in terms of the 
retarded self-energies, 


—2Im2,e(P, E) 
[E 7 Ép — ReÈ elp, E)? + [Im2,.:(P, E)? 


where ImÈ < 0 so that A > 0. One way to obtain the spectral function is to evaluate the self- 
energies. The method for doing this is described in the next section. 

The formal distinctions between the Matsubara, retarded, and advanced Green’s func- 
tions are best understood by some simple examples. There are some simple functions which 
have the correct analytical properties. For example, consider a self-energy operator which has 
the following functional form, 


A(p, E) = (3.142) 


X(p, Z) = Cln[ f(p) — Z] (3.143) 


where Z is a complex variable representing the frequency. Take C as a constant and f (p) as 
some function of momentum. The Matsubara self-energy is evaluated at the points ip,: 


X(P, ipa) = Clnl f (p) — ppl (3.144) 


The analytic continuation ip, — E + ið to the real axis has the following values. For the 
retarded function, ip, > E + ið 


Let(P, E) = C In| f(p) — E| — inCOlE — f(p)] (3.145) 
while for the advanced function, ip, => E — ið, 
Laav(p, E) = C In| f(p) — El + inCOlE -f p) (3.146) 


These two self-energies differ in the region E > f(p), because their imaginary parts have the 
opposite sign. This difference agrees with our general theorem that 


Gre(p, E) = Gaay(p, E) (3.147) 
which also implies that 
Lret(P, EY = Žad (P. E) (3. 148) 


This functional behavior is shown in Fig. 3.2. There is a branch cut on the real axis for 
E > f(p). This branch cut just expresses the fact that In( f — Z) is not a continuous function 
of Z across the real axis for E > f, since the imaginary part changes sign. 

Another example which has similar analytical properties is 


X(p, Z) = CL f(p) — Z]'” (3.149) 
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fip) lim 5>0 


FIGURE 3.2 


This function also has a branch cut for E > f(p), with Im < 0 above the branch cut and 


Im = > 0 below. In fact, a branch cut is a necessary feature whenever Im 2 4 0. Whenever 
Im x Æ 0, then 


X(p, E + iô) £ X(p, E — iô) (3.150) 


which requires a branch cut in the analytic function. When self-energy functions are eval- 
uated, they are often given by logarithmic or square root functions. 

When a branch cut occurs and Im + 0, then the spectral function is given by (3.142). In 
frequency regions where Im È = 0 and there is no branch cut, then take the limit of Im £ — 0 
and obtain 


Im2=0: A(p, E) = 2nd(E — bp — Re Èe, E) (3.151) 
Here the spectral function is again a delta function, but now the real part of the self-energy 


may be nonzero, and usually is, so that it affects the spectral function. Denote by E(p) the 
solution to the equation 


E(p) — u = čp + Re 2re(p, E(P) — y) (3.152) 


Assume that there is a problem in which (3.152) is satisfied when Im È = 0. Recall that if 
g(x) = 0 at x = Xp, then delta functions have the property 


d[g(x)] = a (3.153) 
and the spectral function is written as 
A(p, E) = 2nZ(p)o(E — EP) + p) (3.154) 
Z(p) = z l (3.155) 
i — zp ei. E) eet) 


The factor Z(p) is called a renormalization factor. If we recall the facts that (1) A(p, £) > 0 
and (2) f dE/(2n)A(p, E) = 1, then it is easy to show that Z(p) < 1. The strength of the delta 
function peak is always less than or equal to unity. An example is illustrated in Fig. 3.3. The 


A(p,w) 


a) 


aw 
2 


Epu w 
FIGURE 3.3 The spectral function A(p, œ) will have a delta function peak whenever ImZ = 0, which is shown here 


at œ = E, — p. The spectral function also has values in regions where ImÈ # 0, as shown by the crosshatched 
distribution. 
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spectral function is shown with a sharp delta function peak at œ = E, — and a continuous 
spectra in the region @, < œ < œ. The latter region is where Im È Æ 0. Since the total 
integrated area under the entire spectra is unity, the existence of any areas where Im È Æ 0 
implies that the renormalization factor Z(p) for the delta function is less than unity. 

Equation (3.152) may be used to define the effective mass. Assume that the noninter- 
acting states are free particles, so that 


p? 
Sp =>, N= & hH (3.156) 


Furthermore, assume at low momentum that E, = E(p) varies quadratically with momentum, 


2 
P 
E, = Eo try 


+ O(p*) (3.157) 


The proportionality constant is the inverse effective mass m* 


OE 
“a (3.158) 
m* dE, 
and the derivative of (3.152) is 
Oy _ i; 1+ Red E)+ ? Re X( E) Fp 3.159 
de 0\ | de, P| Lae | Ge, (3-159) 


The last term on the right contains the factor (3.158), which is brought to the left to finally 
obtain 


m f + (0/d€, )Re X(p, Eo — 2] (3.160) 


m* — «,-+0| 1 — (0/3Eo)Re E, Ey — p) 


This formula will be used frequently to obtain the effective mass from self-energy calcula- 
tions. 


3.4. DYSON’S EQUATION 


The Matsubara Green’s functions are evaluated by the same method of Feynman diagram 
techniques that were introduced in Chapter 2 for the case of zero temperature. These methods 
are slightly modified to account for complex times and frequencies, but otherwise the S- 
matrix expansion appears to be very similar. The ideas behind expanding the S matrix are 
presented, and Dyson’s equation is rederived for the Matsubara Green’s functions. 

Consider the case of the electron Green’s function: 


G(p, t) = —e°°Tr[e PX T.(e* Ce * Chg] (3.161) 
e BO — Ty(e7 PK) (3.162) 
Again consider the general case where 
K = Ko + V = Ho- uN +y (3.163) 
H = Hy +V (3.164) 
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where Kọ is a problem which can be solved, so that we know its complete set of states. The 
Hamiltonians under consideration usually have the property that they commute with the 
number operator: 


[H), N] = 0 (3.165) 
[H, N] = 0 (3.166) 


In this case, simultaneous eigenstates of H, and N can be defined and also of H and N. Call 
Ho or Ky the noninteracting problem, and V is the perturbation whose effects are being 
evaluated. Consider the operators in the interaction representation 


U(t) = ee" (3.167) 
U`! (t) = ee% (3.168) 


A caret is put above operators to denote their t dependence with respect to the noninteracting 
operators: 


olt) = eC, eM = e™?C o (3.169) 
The Green’s function in (3.161) is written for t > 0 as 
G(p, t) = —e”°Tr[e ’®. (ere PE) (e eE) Ce) 
x (ee) Ch] 
Tr(e PK UBU- (Cp UC o] 
Tre PA U(B)] 
The thermodynamic potential in the denominator has been replaced by the equivalent factor 
e~ BO — Tre PK) = Tr[ePh0(ePho PR) 
= Tr[e > U(B)] (3.171) 


(3.170) 


A similar substitution has been made on the factor of exp(—BK) in the numerator. 
The operator U(t) can be solved in terms of t-ordered products. Consider the derivative 


0 
OT 


which can be expressed in the interaction representation as 


U(t) = eE (Ko — K)e *® = —e™o Ve (3.172) 


Ula) = (EM yeee) = HU) (3.173) 


This equation for U(t) may be solved, at least formally, by repeated integrations and using 
U(0) = 1: | 


U(t)=1-— | dt,V(t,)U(1,) (3.174) 


=1-| daad | a | dt V(t) V(t))U(t>) 
0 0 0 


= ¥(-1)" | ESIE dr, P(t) P(t,) (3.175) 
n=0 0 0 
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The summation can be expressed as an ordered product, 


ye J" 


U(t) = Du 


al dt, + | dt,[T.V(t,)---V(t,)] (3.176) 
n=0 


= T, l- | dt, re) (3.177) 
0 


and finally as a t-ordered exponential integral. The strict definition of (3.177) is just (3.176). 
Next consider the definition 


Ta a 
S(T1, T2) = T, exp | dta (3.178) 
Ti 
S(t) = U(t) (3.179) 
It is easy to prove the following operator identities: 
S(t, T1) = U(t2)U"(t) (3.180) 
S(T3, T2)S(T2, 11) = S(t3, T1) (3.181) 


In this notation the Green’s function (3.170) may be rewritten for t > 0 as 
TrleP7,S(B, )Êpo Sho (0) 
Tr[e~PAoS(B)} 


Using the above properties of S($, t) and taking advantage of the freedom to rearrange terms 
within the t-ordering operator, the numerator may be expressed as 


Tr[eP* T.S(B)Cyg(t)Cho(0)] 


G(p, 1) = — (3.182) 


N=- 3.183 
From now on the trace over exp(—BXK ) is written with a subscript 0, 
Tre PX] = 5 (8) (3.184) 
where 9 is any operator. The Green’s function is 
T,S(B)Cog(t)Cho(0 
G(p, t) —_— of T (B) po(T) po( )) (3.185) 


9 (S(B)) 


We continue to write a caret over Ch(0). Although it has no t dependence when t = 0, the 
t = 0 label is important for ordering the operator. Equation (3.183) has been proved only for 
the case t > 0, but this case is the only one of interest. The Tr operator in (3.183) really 
means (3. ne The S matrix is divided into parts, which are ordered with respect to the other 
operators Cpo(T) and Ch(0). This form of the Green’s function is similar to the zero- 
temperature result (2.50). 

The Green’s function is evaluated, at least formally, by expanding the S matrix in the 
numerator: 


o (TSB), (1$ (0) 


-5E y dr,- f dr, (T, 0P) PaO) 68186) 


n=0 n! 
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Each of the nth terms are evaluated by applying Wick’s theorem to the brackets and thereby 
expressing the brackets as combinations of the non-interacting Green’s functions G® and 
D®). Wick’s theorem is applied as before to give, for example, 


(TCs (t) Che (t1) Cy's(t1)Cho(0)]) 
= ôp-kõp=x Oso FD, t= 11)G(p, Tı) — k-k” g” (p, T) (3.187) 


Wick’s theorem has a double meaning for Green’s functions at nonzero temperatures. The first 
meaning is just the pairing feature: start with a state |n} (in the trace) and operate on this state 
by some C,. In order to get back to (n| at the other side of the trace, there must be another 
operator of the form ci appearing in the product of operators. The pairing was used in Wick’s 
theorem for zero temperatures, and the same use is made here. Whenever there is a bracket 
with M creation and M destruction operators, each of the creation operators must correspond 
to the same state as one of the destruction operators and vice versa. 

The other feature of Wick’s theorem concerns the thermodynamic average. The brackets 
o(---) mean the average in (3.184), and usually 


» (AB) # o(4)o(B) (3.188) 


where A and B are two arbitrary operators. Usually, the product of averages is not equal to the 
average of products. However, in Wick’s theorem, we do just that: a product of operators is 
paired up, and each pair is thermodynamically averaged separately. This procedure does give 
the right answer as long as Họ is only bilinear in the operators. Any errors that are made 
vanish in the limit of infinite volume. One exception is for macroscopic quantum states as 
occur in superfluids. As an example, consider the evaluation of 


W = (z ; È Ch OC O E Cs oal) (3.189) 
p,s k,s’ 


where the operators refer to fermions. First, do it exactly. The combination 
(Ch s(1)Cp,s(1)) = np (3.190) 


is the number operator. It is assumed to commute with the Hamiltonian and has no Tt 
dependence. In the thermodynamic average, the cases of p = k, s =s must be treated 
separately from p Æ k or s’ Æ s. The occupation numbers are assumed to be independent of 
the spin index: 


W = E (Ch Co s Cf y Crs) + Zlo lC} sCp sC} sCo.s) 
p.s 


s,s’ p#k 
+ (Ch sCp, Koi —sCp,—s)] (3.191) 
=> DE nk + oly +n] (3.192) 
s,s’ p#k 
2 
= (Zr) +3 n [l — ny] (3.193) 
P,s Ps 


The average of (Ch, sCp, Ch, sCp,s) gives only one factor of n, since the occupation number 
c$ sCp,s is either zero or one. "Squaring it still gives zero or one. However, the average 


NCC sCp, Kon —sCp,—s)} gives n? since the two spin states are averaged independently. 
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The same problem can be solved by the application of Wick’s theorem which gives 


2 
W = (=) - Gp, t— VG (p, v — 12) (3.194) 
P.s Ps 

Recall that G(p, t) = —[O(t) — n,|exp(—&,t). Then the above combination of Green’s 
functions gives (t = 0) 


Gp, 1G p, —1) = [O(t) — n, J[O(—1) — np] = —n, [1 — np] 


and this method yields the same answer (3.193) as the exact method. 

Diagrams in the S-matrix expansion are classified as connected or disconnected 
according to the conventions discussed in Sec. 2.6. Only connected diagrams are retained, 
since the disconnected diagrams are canceled by the vacuum polarization diagrams. For 
Matsubara Green’s functions, the vacuum polarization terms come from the denominator: 


-Bo œ (—1)” p° P a a 
TETTO l da | dro (T, Pa PO) (3.195) 


n=0 0 0 


The expansion of this quantity produces a series of diagrams which just cancel the discon- 
nected parts of the expansion in (3.186). In the zero-temperature case of Chapter 2, it was 
found that each vacuum polarization graph gave infinity, which was perhaps not a worry since 
they canceled out of the answer. In the Matsubara formalism, the vacuum polarization terms 
are all nonzero. The “time” integrals, which are now qt integrals, are only over the range 
0 <t < B and the extra integration does not diverge. The Matsubara formalism eliminates 
this one divergence, which is another small advantage of the method. 

The summation in (3.195) actually evaluates a useful quantity: the thermodynamic 
potential Q. One might actually wish to calculate it. The method of evaluation is presented in 
Sec. 3.6, and is called a linked cluster expansion. 

The Matsubara Green’s function can be reduced to an evaluation of all connected, 
different diagrams: 


00 B B n A a a 
G(p, 1) = — LOD | dt: ef dipo (T, Cpo DV 0) VNO) (3.196) 


where only the different, disconnected diagrams are evaluated. Each diagram is evaluated as a 
function of t, and the Green’s function of energy is found from the Fourier transform: 


B 
G(p, ip„) = l dte”:"G(p, T) (3.197) 
0 


and p, = (2n + 1)nkgT. The terms in the series (3.196) yield self-energy diagrams, which 
may be collected into Dyson equations: 


GD, Pn) 


Gp, ip,) = —. > (3.198) 
1—G(p, ip, )E(P, ip,) 
DOg, iW, 
D(q, iœ) = — P ao (3.199) 
l- D (q, iw, I1(q, iO) 
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FIGURE 3.4 


As an example, consider the basic electron self-energy contribution from the electron—phonon 
interaction shown in Fig. 3.4. This self-energy was evaluated in (2.111) using zero- 
temperature methods. Here it is evaluated again using Matsubara methods. The interaction is 


l 
V =X MA 3 Chia oCpo (3.200) 
q oO 


The n = 2 term in the S-matrix expansion is 


1 B B A A 
G> = — — > M,My | dt | dt o(TAg(t )Ag(T2)) 
V aq’ 0 0 


x do o (TeCps tC rg s(t) Cu, sO y g (2) 


k,k’,s,s’ 
x Cy g(%)CTy) (3.201) 
The factor of 
o (TAa) = q-q DOA, T1 — T2) (3.202) 


gives the phonon Green’s function. Applying Wick’s theorem to the electron bracket yields six 
terms, but there is only one different disconnected term. It gives the contribution: 


1 B B 
GP.) = -ipm | dr, l dt,D(q, t — t) 
V aq’ 0 0 
x G(p, t= 7) OP + q; T1 — 1G, 12) (3.203) 


The t integrals may be performed easily, since the t dependence of the non-interacting 
Green’s functions was derived earlier in (3.49) and (3.72). However, this step is rarely 
performed. Instead, it is easier to take the Fourier transform in (3.197). The t integrals are 
done by using the expansion given in (3.39) and (3.65), such as 


1 , 
DO(q, Tt] — T3) — B> eh- q, i©,) (3.204) 
O, 


1 , 
GOP, t= t) = p E PEGO, ipn) (3.205) 
Pm 
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so that 


1 
G(p, Pn) =- Ma yD, io,,)G, ipy) 
q 


1 1i 
m,n’ ,n” nl” 


B B B 
x Gp — q, ip n)G(p, pw) | ar | dt | dt 
0 0 0 


x explip,T — 1@,,(T, — T2) — Py (T — 7) 
— ipw (ti — T2) — Py T2] (3.206) 


All three t integrals are of the form 


1 f | l Bip — 
— dre Pn-Pm) — —— olin Pm) __ 1) 
Bo iB, pa) 
1 nnm 
= nGa WE ) — 1) = Onn (3.207) 
which are written in the shorthand notation 
1p 
5 dre”? Pr) — on, =p, (3.208) 
0 
O 
zl dre P” =P — Om) = Dy =D ql + Om (3.209) 
0 
1, 
5 | dels PrP" +n) =S, ay, (3.210) 


Combining these results yields the final result 
Go(P, Pp) = G” (P, ip) E(B, ip,) (3.211) 


where the one-phonon self-energy term is now 


ZOP, ip,) = -5 E MDO (q, 10,,)9(P — q, ip, — 10n) (3.212) 
Or 

The self-energy (3.212) in the Matsubara notation is similar to (2.111). The only difference is 
the use of complex frequencies rather than real ones. Also note there is frequency conser- 
vation in this expression. An electron starts with frequency ip„, and it emits or absorbs a 
phonon with frequency i@p, so the electron goes to an intermediate state with frequency 
ip, — i@„. Again the summation is taken over internal variables, in this case the momentum q 
and the frequency iw,,. In general, the complex frequency is conserved at each vertex in the 
Feynman diagram. This conservation does maintain the oddness of fermion frequencies and 
the evenness for bosons. For example, the electron in (3.212) has the frequency ip, — 1@,. 
Now ip, is an odd integer, since it was associated with the original electron line, while iœ, is 
an even integer, since it was associated with the phonon. But an odd plus an even is odd, 
which preserves the oddness of electron frequencies. 

The exact Matsubara Green’s function may be obtained by finding the exact self- 
energies. The exact solution is rarely possible, and usually one is content to write down a few 
diagrams and evaluate the first few terms in the expansion for X(p, ip,,). To obtain these terms, 
draw the Feynman diagrams, conserve momentum and frequency at each vertex, and then 
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integrate or sum over internal variables. All that is needed now are some rules for 
constructing diagrams: 


. With each internal electron line, associate a quantity gO (p, ip,„). 

. With each internal phonon line, associate a quantity MjD(q, ia,,). 

. With each internal Coulomb line, associate a quantity v, = 4ne /q’. 

. Conserve momentum and complex frequency at each vertex. Keep in mind that 
fermion frequencies are odd integers (2n + 1)mk,T and boson frequencies are even 
integers 2nnkgT. Their oddness and evenness will be maintained in the energy 
conservation. 

5. Sum over internal degrees of freedom: momentum and frequency. Internal variables 

are all those except the (p, ip,,) of the self-energy. 

6. Multiply the expression by 


BR U Neme 


(DE 25 + DË 


ayy" (3.213) 


where F is the number of closed Fermion loops. The (2S + 1) factor is a summation 
over spin degrees of freedom, and 2S + 1 = 2 for electrons. The integer m is the 
order of the diagram, as defined earlier in Sec. 2.8. 


Several examples are presented. The basic fermion loop in Fig. 3.5(a) occurs in the phonon 
self-energy and also the Coulomb self-energy. It is 


2 
P(g, iœ) = By YG, Dn )GO(P + q, Pm + ion) (3.214) 
q,iP m 


In Fig. 3.5(b) a phonon line has been added between the two electron lines, which is called a 
vertex correction. This diagram gives, according to the rules, 


2 
P(g, i0) = —— X gO, Pp )G(P + q, ipm + ion) 

(BV) ai, 

x MED (k, ik, )GO(p +k + q, ip, + io,, + ik,) 
kik, 

x Gp +k, ip, + ik,) (3.215) 
(a) (b) 

LP, +e Wy, iP tiw, P eiw tiul 

tw, twn tu, 

iP, tP, tFn * twf 


FIGURE 3.5 
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3.5. FREQUENCY SUMMATIONS 


When using the Matsubara Green’s functions, one must often evaluate frequency 
summations over combinations of unperturbed Green’s functions. The technique for evalu- 
ating these summations is discussed for both cases of unperturbed functions and also Green’s 
functions with self-energies. First is presented a table of results for combinations which often 
occur. Their derivation is later: 


Na + np (p) 
a a 
a Fl —nr(&p) 


l 
Bp E DO (q, 0m) GOP, ip, + iOm) = 


TA T (3.216) 
n p q 
l = 
a G(p, ip, )G(k, ip, + i@,,) = oe (3.217) 
l Vin ; Ore ia o; 1 — npp) — nrl) 
p> G(p, ip, ) GO (k, ion — ip,) = ae =, TE (3.218) 
LS gO, ip,) = Mey) (3.219) 


n 


The combination (3.216) occurs in the electron self-energy (3.212) from the electron-phonon 
interaction, while the combination (3.217) is found in the basic polarization diagram (3.214). 
The other combinations will be encountered later. 

First consider the summation over a boson series, where the summation is over even 
integer combinations ©„ = 2nmkgT. For example, consider (3.216): 


1 20 1 
E ee (3.220) 
Bah + aap, tio, G 
First, write this equation as 
1 
D= -gS On) (3.221) 


where f (iœ) is the product of Green’s functions in (3.220). This summation is evaluated by a 
contour integration. The integral has the form 


Ta gm E Ff (z)np(z) (3.222) 


where the contour is a large circle of radius R in the limit as R —> oo. The function n,(z) is 
chosen to generate poles at the points iœ,„ for all even integer m. The function which does this 
is 


] 


The poles of ng(z) were discussed earlier at the beginning of the chapter, in (3.6). There are 
poles at the points i2nmk,T for all positive and negative integer m and m = 0. The residue at 
these poles is 1/B. In Fig. 3.6, these poles are shown as X marks which are evenly spaced on 
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FIGURE 3.6 


the vertical axis. The large circle is the contour of integration, which is a circle of radius R. 
The function f(z) is 


20, 1 


— 3.224 
z? — wip, tZ — Ëp ( ) 


fe) = 


It has poles at the points +œ, which originate from the phonon Green’s function. Another 
pole is at the point €, — ip,, and this pole comes from the electron Green’s function. The 
poles, and their residues, of the integral J in (3.222) are 


1 
Z„ = i2nmkgT, R = B fiOn) 


R Aa 
Z. = 0 = — 
' a ' ID, E Ép + Wy 
N,+1 
Z} = —0Og R, = a 
IPn — Ép — Wg 
, —20qF (Ép) 
(ip, ~ Ep) ~~ Wg 
where our notation is that N, = n,(@,). In the last residue, since exp(ip,,B) = —1 then 
l l 
ng(6p — iPr) = BGP) 1 ea] —nr(5p) (3.226) 
The last residue may be rewritten in partial fractions 
n n 
R; —- rp —- F(Sp) (3.227) 
iP, — Sp + Oq Pn — Sp — Oq 
The integral is evaluated by adding all these residues: 
i Na +n N, +1—n 
I= -F flion) +- q F(p) q r(Sp) 
m Py — Sp + Oq Dn — Sp — %q 
The integral vanishes (J = 0) in the limit that R —> 0 which gives the final result, 
N, +n Nvtl—-n 
S= Na + nrp) 4 Nq + 1 — nr(&p) (3.228) 


iP, — Sp + Oq iP, — Sp — Oq 
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which is (3.216). The method of evaluating these boson series is quite simple. To evaluate a 
series such as (3.221) one just finds all the simple poles of f(z) which are at the points z; with 
residues r; and 


S = ) rjnz(z;) (3.229) 
J 
The same procedure is used to evaluate fermion series. Then the summations are of the form 
| , 
S=--> fipn) (3.230) 


where p„, = (2n + 1)nkgT contains odd integers. One constructs the same contour integral as 
in (3.222). Now the function np(z) is used since it has poles at the points ip, with residues 
—1/ß. The integral again is J = 0, so that the summation is over the residues r; at the poles z; 


of f(z): 
S=-—) rn,(z;) (3.231) 
T 


The minus sign in front occurs because the residue of the fermion np (z) is (—1/B), whereas it 
is (1/B) for the boson ng(z). 

As an example of summing a fermion series, again consider (3.216). The summation 
variable is changed to p,, = p, + @,, so that the summation is now 


l 
S=-5} DQ, ipw — ip,)9® P, ipx) (3.232) 
1 20g 1 
— iy (3.233) 
p 2 Pw — Pn) + @2 iPr — Sp 
20 1 
SO =——_5 (3.234) 


Py contains odd integers, since it is a fermion frequency. The function f(z) has the following 
poles and residues: 


2WgnF(Sp) 
(Cp — PY — 0 
—np(ip, — Oq) 
Pn — Ep — O 


— +np (ip, + Wg) 


Z = Sp Ry = 


Z2 = IP, — Og» R, = 


Z,=ip,+®, R=- (3.235) 
° P a ° IPn — Ép + Og 
The last two thermal factors may be simplified: 
, 1 l 
np (ip, + Wg) = —Ng (3.237) 


The final result is obtained, as in (3.231), by adding these three residues, which gives again 
the same answer (3.228). 
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Three summations were listed at the beginning of the section. The first has been derived 
twice. The next two, (3.217) and (3.218), are left as an exercise for the student. The fourth 
result (3.219) is now explained 


5 > GC. ipa) = eG) 


The left-hand side of this equation is most familiar as the Fourier transform of the Matsubara 
Green’s function: 


Gp, 1) = 3 eg (p, ip,) = —o(T,Cyo(t)Cho(0)) 


The result (3.219) is just the limit as t — 0. This limit is ambiguous, since a different result is 
obtained if t = 0 is approached from the positive or negative direction: 


gp, T= 0”) = —0 (Cpo C}o) = —[1 — nr(5p)] (3.238) 
G(p, t = 07) =9(Ch Cho) = ne(Ep) (3.239) 


The result (3.219) is merely the convention of adopting the limit t = 07. This choice has been 
discussed several times before. The same result at zero temperature was expressed in (2.86). It 
is the convention that two operators at equal time are taken in the order which gives the 
number operator: 


a o(T:Cpo(1)C}o(1)) = o(Cio Cpo) = np (Ëp) (3.240) 


The Matsubara sum in the preceding example was very easy. Summations which contain only 
G and D® always cause the contour integral J to have just poles. If the summation 
contained G, instead of go. then the J integral has branch cuts. Next is shown how to treat 
this more difficult case. 

Consider the example of evaluating the summation (3.216) except that the Green’s 
function for the electron contains some self-energy terms: 


Dy _ Ép — X(p, iD») (3-241) 


GP, ipa) = 
Letting ip, = z then the above function has branch cuts along the real z axis. These branch 
cuts arise because of the self-energies X(p, z), as discussed in Sec. 3.3. The contour integrals 
used to evaluate the frequency summations become more complicated because of the branch 
cuts. 
The most general possibility when evaluating a summation over Matsubara frequencies 
is to have all the Green’s functions fully dressed. Then it is easiest to proceed by first 
expressing all the Green’s functions in the Lehmann representation. For example, evaluate 


S = -5 E DQ, idy GPP, + i, (3.242) 


Express each Green’s function as a frequency integral over its respective spectral functions: 


© do Bq, 0) 


D(q, i®,,) = . 
a im) [ an io, — o (3.243) 
© de A(p+q, 6) 
9 Fn m) — 7: ʻ — 3.24 
AP +4: ipn + iOm) eee (3.244) 
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and the summation S becomes 


CO 


° do dé l 
s=| Paq.c)| Tap +a Slp 08) 


, l l l Np(@) +np(£) 
S,(ip,, ®, €) = -=Y —— m n E 
Pn» ©, £) Bi, op, tio, e ip, +©— E€ 

oe dw oe dé nz(@) + np(€) 

S = — B(q, —A , E) — 

S (q D| s (p + q, £) ip, +o —€ 


The summation S)(ip,,, ©, €) is now the easy type which is evaluated using noninteracting 
Green’s functions. The final result provides for the most general case of fully interacting 
Green’s functions. In general, any frequency summation can be done by expressing all the 
Green’s functions in their Lehmann representation, which results in a summation of the easy 
type listed in (3.216) to (3.219). 

The result (3.216) has now been derived in this section by several different methods. 
Now we explain the physics behind this result. It is a bit subtle. The virtue of the Matsubara 
method is that these subtleties are automatically handled correctly. It is a machinery which 
cranks out the correct thermal occupation factors np and N, even in complicated cases. 

The summation (3.216) arises from the electron-phonon interaction and is a phonon 
contribution to the self-energy of the electron. From the rules for constructing diagrams, the 
one-phonon self-energy of the electron is 


l N,tl—-n7 N,+n 
Lp, ip,) = — YM? Nq +1 = Mpa) 4 Ng + nr pra) (3.245) 
vq Pn — pta Oq IPn — Sptq + Oq 
This result is similar to what is obtained from second-order perturbation theory. The standard 
quantum mechanical expression for this quantity is 


42 
AE®) = y lieli (3.246) 
T EEr 

where |i) is the initial state of the system, and the summation |/) is over possible intermediate 
states. It is assumed that the initial state is an electron in state p with energy €p. The electron- 
phonon interaction has two terms. One of these is M,C} +q.6Cpo4 q; which describes a 
process where an electron scatters from p to p + q while creating a phonon of momentum 


—q. This contribution from second-order perturbation theory is 


lom (Na + DU — Nr(Spiq)] 
va 4 Ep — pig — Oq 


(3.247) 


The factor (N,+1) comes from the phonon creation operator, a! qln_q) = 
Jn_q + llna + 1). The thermal average is used for n_,, which is Ng. Similarly, the factor 
[1 — np] is the probability that the electron state p + q is empty, so that the operator ci +40 
can create an electron in that state. There is also a factor np(&p), which is the probability that p 
is occupied with an electron. It is assumed this factor is unity if we are trying to calculate the 
properties of an electron in that state. The energy denominator gives the difference between 
the initial energy €, and the energy in the intermediate state €,,4 + @, which has an addi- 
tional phonon. 


p+q 
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Similarly, the interaction term which destroys a phonon is MjCpigoCpo%q- In second 
order it contributes an energy term 


N,l! E Nr(Sp+q)1 


3.248 
Ep — Epig + Oq 


l 2 

V 2 Ma 
Here the factor N, is the matrix element of the phonon destruction operator aqla) = 
Jalna — 1). The electron occupation factor [1 — np] is the same as before. The phonon 
energy in the denominator has changed sign, because there is one less phonon in the inter- 
mediate state. 

These are the only two terms in the Hamiltonian. The summation of these two contri- 
butions, however, does not equal (3.245). In fact, some important processes have been 
omitted. There are two more terms. They arise from the other electrons in the system with the 
same spin state o. These other electrons are also trying to emit and absorb phonons, to alter 
their energy. The other electrons may not, in this process, use the state p as their intermediate 
state. For example, if another electron starts in p’ and scatters to p’ + q, we may not have 
p = p +q. They may not use p because our electron, whose energy we are trying to 
calculate, is occupying it already. The other electrons of the system have a reduced second- 
order self-energy because our electron is in the state p. This energy reduction is associated 
with p. It is assigned to the self-energy of the electron in p, because the system would not 
have it if there were no electron in p. A calculation of the self-energy of an electron in the 
state (p, ©) must calculate how much energy is deprived to the other electrons. Assume they 
start in p + q initially and get scattered into p. In analogy with the two terms above, the self- 
energy of a particle in the state p + q, for emitting or absorbing a phonon and going into p, is 


N,+ 1 N 
q p 


+q Ep7 Oq Ep+q — Ep + Oq 


The factor [1 — np] is the probability that p is empty. This factor is dropped, since p is 
occupied. The factor n-(&,;,4) is the probability that there is an electron in p + q; if there is 
none, then the electron in p is not depriving any electron of energy. The calculation assumes 
that the energy does not depend on spin, so the occupation factors np should not depend on 
spin either. These last two terms are then subtracted from the first two to give the result, from 
second-order perturbation theory, 


Na + 1 — nepra) parea (3.250) 


l 
AE? =- M? 
vq Ep — Ep+q %q = Ep — Eptq Tq 


The Green’s function theory (3.245) changes this result only a little bit. It allows that the 
original electron may have had wave vector p, but its “energy” is ip, + p and not €,. It keeps 
the energy and momentum as separate variables, at least at this stage in the calculation. An 
obvious improvement in the calculation is to make the intermediate state energy not Ep+q but 
some other energy variable which reflects the possible effects of interaction. The difficulty 
with such a procedure is that changes in €,,, also require changes in the occupation prob- 
ability n-(,;,). Later it will be shown that one subset of diagrams, when summed to infinity, 
yield exactly this approximation; the intermediate state energies are treated exactly also. 

In many-particle systems, the interactions which alter energy states of particles usually 
lead to subtle counting problems about where energy has been added or subtracted from the 
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system. The Matsubara method has the virtue that it cranks out the correct answers auto- 
matically. 


3.6. LINKED CLUSTER EXPANSIONS 


Another method for evaluating correlation functions is called the linked cluster method 
or the cumulant expansion (Abrikosov et al., 1963; Brout and Carruthers, 1963). It is the 
method used to evaluate the thermodynamic potential Q. However, the method has also been 
applied to evaluating Green’s functions of time G(p, t) for a few problems, which represents 
an alternative to the usual Dyson equation approach. Both methods—Dyson’s equation and 
linked cluster—would give the same result if all terms were summed exactly. However, in the 
usual case, when only a few terms in a perturbation expansion are evaluated and the rest are 
neglected, Dyson’s equation and the linked cluster method are both approximate solutions. In 
some special cases the linked cluster expansion may be the better approximation. The art of 
choosing which method to apply to a given problem is a matter of judgment, which comes 
from experience. 


3.6.1. Thermodynamic Potential 
The thermodynamic potential is found from the equation 
e 82 — Tr(ePS) = Trfe PX 5(B)] (3.251) 


This quantity is useful to evaluate by itself. After it is found, perhaps only approximately, 
some thermodynamic results are determined by taking its derivatives; for example, if N, is the 
number of electrons 


_ 22 wy =N, (3.252) 
Ou 

oe = (K) = U — uN, =Q4+T7S (3.253) 
aQ 2 p 

mon = (S55) = DM (3.254) 


In the last identity (3.254), the function n, is meant to be the number of electrons with 
momentum p. This quantity was defined earlier, in (3.135), for an interacting system. 
Usually there is a Hamiltonian which is written as 


H=H +V (3.255) 
K = Ko + V = (Ho — uN.) + V (3.256) 


where Ho is exactly solvable. In the interaction representation, the thermodynamic potential is 
given by the right-hand side of (3.251). If there are no interactions and V = 0, then the 
definition of Qy is 


e Po — Tr(e7 BXo) (3.257) 
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For a system of electrons and phonons, 
Ky = oop Clo Cys + X (ala, +4) (3.258) 
po q 


the trace is simple since the noninteracting eigenstates can be used to expand the trace 
eigenstates: 


—(1/2)Bo 
-BQ -BE \2 e a 
Take the logarithm of both sides of this equation, 
BQ, = —2 3 In(1 + eP) + Xin — e Pa) +1 Bog] (3.260) 


The factor of 2 in front of the electron term is the spin degeneracy. The last term on the right 
is the zero-point energy of the phonons. Change the summations to integrals, say for the 
phonons 


d q 
(27) 


ZF) = v] F(q) (3.261) 
q 


and a factor appears of the volume v. The thermodynamic potential is proportional to the 
volume of the system. This result is also apparent from Eqns. (3.252) to (3.254), since the 
right-hand side in each case is proportional to the volume or to the number of particles. 

The goal is to evaluate (3.251) including S(B). The interaction V will add some 
correction terms to Qo which changes it into Q. The method of evaluating this correlation 
function is by the S-matrix expansion given earlier in (3.195): 


po _ ,-po, L CD” f° P 7 ; 
e = e 0 5y — dt cee dt, oT V) V(t,)) 
nao A! Jo 0 


In evaluating the right-hand side, all diagrams are included, whether connected or discon- 
nected, since there is no other series at hand to cancel the disconnected diagrams. The 
disconnected diagrams are necessary for the resummation procedure. Basically, the terms in 
the S-matrix series are examined in order to find some convenient method of resumming 
them. It is efficient to evaluate each different type of diagram only once. The way of doing 
this expansion is the linked cluster, or cumulant, expansion. 

The answer is stated first. The following discussion will try to make the answer appear 
plausible. First, introduce the parameter à. It multiplies the potential V, so that everywhere 
there was a factor of V, or V(t), now write AV or AV (1). The parameter A is actually just unity, 
but it is used to keep track of the number of times the potential occurs in each term in the S 
matrix. The S-matrix expansion in (3.195) may be written as 


(0,6) 
eo BO — eb% O NW, (3.262) 
n=0 


(—1)" 


n! 


n 


B B a A 
| dt,-- | dt, o(T,V(t,)--- V(t,)) (3.263) 
0 0 
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The basic linked cluster theorem is that this series can be resummed into 


e PO — exp( =B + 3 N u;) (3.264) 
[=1 
1 A A 
y= Dp dr,- E (3.265) 


where U, contains just different, connected diagrams: The thermodynamic potential is 
obtained by setting A = 1 in (3.264) 


Q=2,--dY, (3.266) 


The basic theorem is that the thermodynamic potential is the summation of the different, 
connected diagrams. The factor 1// in (3.265) turns out to be quite a nuisance, as is discussed 
later. It makes further summation difficult, except by a trick. 

The theorem will be proved below. First it is helpful to give a simple example. Consider 
the perturbation V to be the earn phone interaction: 


C 


40 (3.267) 


y = > Ty Ag Cha, o 
For this example, only terms with even n occur in the series, since the expectation value of 
(AG) is zero if n is odd. The first nonzero term in the series, after n = 0, is n = 2: 


B 


1 B na 
W, = U, =>] dr, | di, (T, PGDP) (3.268) 


0 
This disconnected diagram is not evaluated here. Its Feynman diagram is shown in Fig. 3.7(a). 
There are two electron lines, and the two vertices are connected by a phonon line. Next 
consider the term with n = 4: 


l 


B B B a a A A 
W, = f ax, | dr, | dr, | di, TP CDEP Ea) 


4! 
All contributions for n = 4 consist of diagrams with four electron lines and two phonon lines. 
Some are shown in Fig. 3.7(d). However, there also occurs the diagrams shown in Fig. 3.7(b) 
where the single bubble occurs twice. In fact, there are three identical terms in W4 which have 
a pair of disconnected bubbles. The factor of 3 occurs because, if one chooses T, as one of the 
vertices of a bubble, the other t variable for the same bubble is either T3, T3, Or T4, and these 
three choices each give the same contribution. This factor is 


B B . - B B A A 
z] dt, | AT o(T,V(t,)V (ty) l dT; l d%q o(T:V (t3)V (t4)) = 3 U3 
0 0 0 0 


D O QS Bee 


(a) (b) s 
FIGURE 3.7 
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Similarly, it can be shown that each of the connected diagrams in Fig. 3.7(d) occurs six times, 
where the factor of 6 comes from different ways to shift the dummy variables around. The 
final result is 


W,=1U2+U, (3.269) 


Next consider terms with n = 6. There will be many connected diagrams which contain six 
electron lines and three phonon lines; try to draw them all. They give Us. There are also 
disconnected diagrams. There will be diagrams containing three bubbles, as shown in Fig. 
3.7(c}—in fact there will be 15 of these, all equal to each other. There will also be discon- 
nected diagrams which contain one four-line term from Fig. 3.7(d) and one from Fig. 3.7(a). 
The total is 


l 
Ws = 3! U, + UU, + Us (3.270) 
The terms in U, appear to be generating the series 
1 1 
1 +U, +5 U? taU + =e (3.271) 


There is a simple argument which shows that the nth term is really appropriate for the 
exponential series. Since n is even, let n = 2m, and there are m bubbles of the type shown in 
Fig. 3.7(a). How many different ways are there to get m bubbles? Take the first variable, t4. It 
may be paired in the bubble, 


jl B B 
U, =-=). Må | dt l dt,D(q, Ti — 7) 
2 qpo 0 0 


x GOP, ti — 9 (p + q; T; — 71) (3.272) 


with the variable t, which is any of the other 2m — 1 variables. Next, take any of the 2m — 2 
variables left after pairing t, and t,, say t,, and it may be paired with any of the 2m — 3 left. 


The number of different arrangements is 


(2m)! 


2mm! 


(2m — 1)(2m — 3)(2m — 5)---5 -3 - 1 = (3.273) 


This factor in W, is 


2m)! 1 | 


B B n ` m 1 
n 7 2mm! (2m)! l dT l dT, TPP) = z U2)" 


0 0 


which is the form proposed in (3.271). The term U, has been shown to sum correctly. It must 

be shown that the other contributions also sum correctly. This proof is not too hard. It is more 

subtle to show that all the cross products such as U, U, occur with just the right counting. 
Another identity is: 


(—1) B p - ` 
U, = i | dt oe | at oTr V(t) n. V (T1)} connected (3.274) 


The same Ų; is in (3.265). Comparing these two expressions shows that for each term / there 
are (J — 1)! contributions alike, so that the prefactor in front of the number of different 
connected diagrams is (/—1)!//! = 1/1. The proof is just an exercise in combinatorial 
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counting. The S matrix is expanded in terms of Green’s functions which yields expressions 
such as 


l 
U, = A dti +: f dG Pi, ti — FG (Py, — T) (3.275) 


The combinatorial question is: how many different combinations are really alike if the dummy 
variables T,, T;, T2, etc., are just relabeled? The answer is (Z — 1)!. The reader should try to 
work some examples to convince oneself that it works. The trivial case is / = 2 so that 
(J — 1)! = 1 was done in (3.272). A more complex example is shown in Fig. 3.8(a), where the 
(4 — 1)! = 6 different terms are shown for one of the different diagrams with / = 4. The 
numbers 1, 2, 3, 4 refer to the t variables: T4, 15,73, 74. These six are the only different 
arrangements. One might ask why the figures are drawn such that t; is always the upper left- 
hand corner. Why not the labeling shown in Fig. 3.8(b)? The answer is that this is exactly the 
same term as the first one in Fig. 3.8(a). They are topologically identical but just drawn upside 
down and backwards, or rotated by 180°. However, and most importantly, both diagrams yield 
identical results when written down with Green’s functions, which is the ultimate test of 
whether they are topologically identical. 

In the prior section, the Green’s function G(p, t) was evaluated by a similar expansion 
technique. There the connected diagrams obeyed the counting rules that 


1 a A A A 
7 o(T:Cpo(T) V(t) e. (tC (0) connected 


= nee T Coo(t) V(t) ue Ê (T) Či s (O)) different connected (3.276) 


Here there are /! arrangements which are identical. In the linked cluster method there are only 
(1 — 1)!. The difference is in G(p, t) there is a reference variable t, so that arrangements are 
considered with respect to this variable. There turn out to be more of them by a factor of /. 

Next consider the evaluation of the thermodynamic potential. It is obtained by evaluating 
the series of terms in (3.266). An example is to evaluate the series of diagrams shown in Fig. 
3.9 for the electron-phonon interaction. They can be summed to give a simple answer. Of 
course, this series is not the exact answer, since there are additional terms in which the 
bubbles may also have internal phonon lines. The rules for constructing diagrams for the 
terms in the thermodynamic potential are similar to those in Sec. 3.5. One follows all those 


ta) O2 Q2 Os Q3 Qs Os (b) 4c 3 
sa as se Da ro; ro 20}! 
FIGURE 3.8 


FIGURE 3.9 
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rules and then multiplies the result by B//. The 1// factor is just the one which occurs in 
(3.265), which has been explained already. The factor B comes from the extra t integration. 
For example, in (3.272), which is the first bubble, the argument is only a function of t} — 7;: 


B B 
l dt, | dt, f(t, — 7%) (3.277) 
0 
Using the Fourier expansion 
l —i —T; . 
FG ~ w= BL em fan) (3.278) 
the first integration gives 
1p, 
5 dreti = 6,9 (3.279) 
0 


while the second just gives B. Each term has this extra t integration, which gives the one extra 
factor of B. For the first bubble in Fig. 3.9 


2 
ET MAD. ig, PCA iy (3.280) 
4519, 


where the polarization diagram P)(q, ig,) is for the single bubble, which was already 
evaluated in (3.214): 


U, = 


2 
POG, idn) = By FP: PIF + q, Pm + idn) (3.281) 
Ps!Pm 

The second term in Fig. 3.9 has two bubbles connected by two phonon lines. Using the rules 
for constructing diagrams gives 


Bx y [M4 D(a, iq,)P™ (q, ign) (3.282) 
4B ai, 

Momentum and frequency conservation requires both phonons to have the same variables 
(q, iq,,), and both polarization diagrams are also just functions of this combination. The term 
with n bubbles and n phonon lines is 


U, = 


BA” 
Un = zp p Oe MaD (A gn) POU, ign)" (3.283) 
q,/9, 
It is simple to sum the series: 
1 œ , 
Q- %=-7 2 Un = T In[1 — A MDO (q, iqn)P (A, ign] (3.284) 
n= Ba, iq, 


The right-hand side of (3.284) is a correction to the thermodynamic potential. It is not exact, 
as explained above, since only some of the diagrams are included. The right-hand side is 
proportional to the volume v, as is obvious when the summation over q is changed to an 
integration as in (3.261). Similarly, the summation over p in P® (q, iq„) can be changed to an 
integration: 


] d 
lo- | p (3.285) 
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The argument of the logarithm is not dependent on v, and v enters only by multiplying the 
result. 

The answer is a logarithm. The summation over ig, needs to be evaluated. This 
summation is inconvenient (but still possible) when the important functions are inside the 
argument of the logarithm. There is a standard trick for eliminating the logarithm, although it 
does not eliminate it but just disguises it. Treat n = A? as a variable of integration, 


K A 
2 
dn ——— = — — 472A 2 
| Ny Inf ] (3.286) 


A = MDO (q, iq„,)P™ (q, ign) (3.287) 


Then for A = 1 


Q -— Q) = 


3 1 M2D pM 
M l dq l q (3.288) 
0 


BT (2n)° n — nM DO PO) 


The logarithm has been eliminated, but another problem is introduced. The right-hand side 
must be evaluated for each value of n and then the integral is taken over n. Since n enters in 
the same way as a coupling constant, this step is called a coupling constant integration. The 
integration can be taken outside of the summation over iq„, which makes this summation 
easier in some cases. 

There are some other manipulations on this formula, which put it into a form which 
appears simpler. First, define the phonon self-energy as 


n(n, q, iga) = NMP” (q, in) (3.289) 


where the coupling constant n is included in the definition. The superscript (1) indicates that 
this expression is an approximate self-energy, which only includes the single-electron bubble. 
The phonon Green’s function is 


D 
D”(n, q, iq,) = (3.290) 


— DO) 


The Green’s function is also a function of the coupling constant strength n. Equation (3.288) 
is rewritten as 


V dq fdn 
Q-Q, =-— | | Ano q, ig,)P(n, q, iq, 3.291 
0 > \Om o n (n, q, ig )D` (N, q, ign) (3.291) 


This expression is an approximation to Q since it employs an approximate self-energy n”. 
There is a theorem which we are going to state but not prove. The exact correction to the 
thermodynamic potential from electron-phonon interactions is 


v dq fdn 
Q- Q, = -— l | on q, ig,)P(n, q, ig, 3.292 
0 BY | On Jon (N, q, iq )D(N, q, iqn) ( ) 


where n(n, q, ig,,) is the exact phonon self-energy and D(n, q, iq,,) is the exact phonon 
Green’s function evaluated with the exact self-energy. Both expressions are found as a 
function of n, and then one has to perform the integration over n. Proofs of this theorem are 
given in Abrikosov et al. (1963). 
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The exact phonon self-energy T(N, q, iq,,) is the sum of an infinite number of self-energy 
diagrams, some of which are shown in Fig. 3.10. There are bubbles with internal phonon lines 
and also several bubbles connected by more than one phonon line. 

Of course, if electron—electron interactions are included, then the phonon self-energy can 
also contain internal Coulomb interactions, etc. The effect of Coulomb interaction on the 
thermodynamic potential should also be included. The general theorem (3.265) and (3.266) is 
true for all interactions: Coulomb, phonon, or others. 

It is possible to combine the effects of the phonons and Coulomb interaction in a simple 
way. Consider electrons interacting by either or both interactions, and then the single-bubble 
diagram appears twice. As shown in Fig. 3.11, the two electron vertices can be connected by a 
single Coulomb line or a single phonon line. The sum of these contributions is 


FIGURE 3.10 


Up =2 E POG, iglo + MD” (a, ign) (3.293) 
qin 


where v, = 4ne?/q°. Similarly, there are four diagrams with two bubbles. The two bubbles 
interact by two Coulomb lines, two phonon lines, or a mix with one of each. This contribution 
is 


2 
n , , 
U, = -7 POG, igava + MaD” (0; ig, Y (3.294) 
q:!4n 


It seems desirable to introduce a combined interaction propagator, Coulomb plus phonon, 
which is 


W(q, iga) = Vy + MEDO (q, ign) (3.295) 
It obeys a Dyson equation, which is 


WG, ign) 


W(q, iq„) = 1 — WO, ig, PO 


(3.296) 


where W(q, igq,,) is the total self-energy diagram, which is the sum of the diagrams shown in 
Fig. 3.9. Of course, now the interaction lines in Fig. 3.9 refer to W), since they may be either 


36 8-80- 


FIGURE 3.11 
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Coulomb or phonon. The generalization of (3.292) to include both Coulomb and phonon 
effects is 


V dq fdn 
a- =-% l | Fr a aW, a ia, 
0 p> | ony Jon (n, q, iq, )W (N, q, ign) 


Vg (3.297) 


The first term contains the Coulomb self-energy of an electron interacting with itself, and this 
unwanted contribution is subtracted out in the second term. The two interactions in (3.295) do 
not complete the possibilities. If the Hamiltonian has additional interactions due to impurities, 
ions, etc., then one has additional contributions which may contribute to the thermodynamic 
potential. 

The expression (3.292) seems to favor the phonon self-energies. It is reasonable to ask 
what happened to the electron self-energies. The answer is that they are in there too. The 
diagrams under consideration can also be viewed as corrections to the electron self-energy. 
This viewpoint does not introduce an additional contribution but is another equivalent way of 
expressing the above answer. For example, the basic bubble diagram, shown in Fig. 3.7(a) and 
expressed in (3.272), may also be written as 


À | | 
Up =F LIMP, ip, JEW, ip,) (3.298) 
Pp» PO 


where 2" (p, ip,) is the electron self-energy from one-phonon processes, which was given 
earlier in (3.212): 


l , , , 
=(p, ip,) = By > MiG (p + q, ipn + idm) DO (q, idm) 
qiam 


We repeat, for emphasis, that the above U, is not a different result than (3.280). It is the same 
result, expressed with different symbols; the same Green’s functions have been grouped 
differently. Similarly, the exact answer is 


l ld 
2-9 =5 E | FaN. p. pO p. ip) (3.299) 
B pip, on 


where ÈŁ(n, p, ip,,) is the exact self-energy of the electron and G(n, p, ip,,) is the exact Green’s 
function obtained from Dyson’s equation with the exact self-energy. If the propagator W in 
(3.295) is used for all internal lines so that the electrons can interact via phonons or Coulomb 
interactions, then (3.299) is exactly equal to (3.297). The two results represent the same set of 
corrections. They are not different contributions but are the same contribution expressed in 
different notation. 

These formulas are often used to calculate the ground state energy in the limit of T — 0. 
There are two limits which are taken: v —> oo and T — 0. In studying the electron gas, Kohn 
and Luttinger (1960) observed that the right answer was obtained if they were taken first in 
the limit of v — oo and then T — 0. The reverse order omits important terms, which are of 
the form 


Bnr (pll — nr (6p) (3.300) 
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If v Æ oo, then all levels are discrete in the finite volume, and as T — 0, these terms give zero 
since either np or [1 — np] is zero. However, if one first takes v — oo, so that the levels are 
continuous, then the limit T — 0 gives 


lim 1 Prr (Ep ILI -_ nr(Sp)] = — lim ~- [- npp) = — (Ép) (3.301) 


Ea 
There are many terms of this kind in the perturbation expansion for the ground state energy of 
the electron gas. They are called dangerous diagrams. They do not threaten your health, but 
they do threaten the accuracy of your answer. 

It is illustrative to show a simple example of evaluating the thermodynamic potential. 
Assume that the only effect of the phonon self-energy is to change the unperturbed 
frequencies o4 to a new set of renormalized frequencies Og. Since the Green’s function is 
20 


= ——___1__.—= — —____ 4 3.302 
(io) 7 oi — 20 qP, i@,) o (iY — % 


20 


This form for D can be accomplished choosing 
a \ 2 2 
2MgT(N, P, 10,,) = NOG — Bq) (3.303) 


The choice (3.303) is not the only possible one to renormalize the frequencies. It is the choice 
one gets by assuming that the change from @, to Q, is accomplished by the one-bubble 
polarization diagram given in (3.289). Then the coupling constant n enters as just a multi- 
plicative factor, as shown in (3.303). If further self-energy diagrams are needed to get a good 
phonon self-energy m, then n would enter in a more complicated fashion. It is shown in 
Chapter 6 that the single-bubble approximation is often adequate, so that the present deri- 
vation applies to many systems. 

Consider the steps necessary to evaluate (3.292), or perhaps just (3.288). The expression 
to be evaluated is 
Qi- o? 

q (3.304) 


1 
Q-Q,=-=— | an — a a 
i Bez (ign) — @% — nQ — 02) 


First, introduce a frequency which depends on a coupling constant: 
2 2 2 2 
Qn = Og + NWO, — 4g) (3.305) 


The summation over Matsubara frequencies may be done by the techniques suggested in Sec. 
3.5. Alternately, one might recognize the answer as 


l 1 


2 


1 
B 2 TA 20, — D y(t = 0) = -za [2n,(Q,) + 1] (3.306) 


20, 
where Dy (t) is the Green’s function which is the Fourier transform of 
2Q,, 


D, (ign) = (ig, -E (3.307) 


The evaluation of (3.304) is now just the coupling constant integral, 


oium 2 fi dn 2 
= 3504-0) | 35 (+35) (3.308) 
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which can be done exactly since (Q? — w”)dn/(2Q,) = dQ,,: 


Q-O2 = JEO +3 In(1 — aa) (3.309) 
q n=0 
1 1 1 — e Bq 
=F]. 00 +5 (om) SA0 


The right-hand side is the thermodynamic potential from the phonons at the new frequency 
Q, minus the contribution from the phonons at the old frequencies ©œq. When this result is 
combined with (3.260) for Qo, the terms with œq all cancel. The final answer is 


_ -2f d'p 
= «BS ny 


+ z| dq E BQ, + In(l — e7F%a))] (3.311) 
B (2 T) 2 q 

The thermodynamic potential is just the summation of the unperturbed electron contribution 
plus the contribution from the phonons at the new frequencies Q,. Of course, this form is 
expected from simple considerations. If the only effect of the electron—phonon interactions is 
to change the phonon frequencies to the new values Q,, then one must be able to solve the 
Hamiltonian exactly and write it as 


H = Ð Cio Cpe + LO gahag (3.312) 
po q 


In(1 + e7 Pr) 


The thermodynamic potential for this simple Hamiltonian is just (3.311). Unfortunately, most 
corrections to the thermodynamic potential are not as easy to evaluate as this simple example. 


3.6.2. Green’s Functions 


So far the linked cluster, or cumulant, expansion has only been used to evaluate the 
thermodynamic potential. One could use similar expansion techniques to evaluate other types 
of correlation functions, such as Green’s functions (Brout and Carruthers, 1963). This method 
has been used with success on two problems. One is the polaron problem for a single particle 
in a band, studied by Dunn (1975). The other has been in treating X-ray processes, where the 
hole dynamics have been studied by a linked cluster method (Mahan, 1975). Here the hole 
has also been treated as a single-hole problem. It has not been applied to a many-polaron, or 
many-hole, problem. The procedure begins with the S-matrix expansion for a particle Green’s 
function, given in (3.196): 


gp.) = È MWp, T) 


(—1)" p B x A A a 
W,(p, Tt) = — n! | at, C | AT (TCyg(t)V (11) oe V(t,)Cy5(0)) (3.313) 


where the coupling parameter À has been added to the expansion. In the cumulant expansion, 
it is assumed that the series can be regrouped as an exponential power series in À: 


gp, 2) = 9p, exp] È wF, (3.314) 


n=1 
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The functions F,(p, t) are found by equating terms with like powers of à in the series 
expansions. That is, write 


] 
J= 


2! 
l 
+A (F, + FF) + af] o. (3.315) 
so that, term by term, we have the equalities 
W, = G®F, (3.316) 
1 
W, = g® (r; + Ta (3.317) 
1 
W, = g® (2 + FF, + T (3.318) 


The functions W, (p, t) are scalar functions which are calculated using Wick’s theorems. The 
unknown functions F,(p, T) can be evaluated in terms of W, by solving the above equations 
(Brout and Carruthers, 1963): 


W 

F, = 0 (3.319) 
W, l 2 
W. l 

F, = ro) — FF, — xfi (3.321) 


The basic idea is very simple. It is just a different way of regrouping the series which results 
from the S matrix. It provides an alternative to using Dyson equations. Presumably, one gets 
the exact answer if all terms are taken and evaluated. However, since one can only get the 
exact answer in trivial problems, the method is really useful in problems where one can 
calculate a few terms in the perturbation expansion. 

There are two possible difficulties with this procedure. The first is that the regrouping 
may not make mathematical sense. Nonsensical results are found in cases where G(p, 1) 
describes a Green’s function of a particle in a system with N, like particles, where N, œ v. In 
that case the terms F,, do not converge when put into the exponent. The offending terms are 
those which contribute to the total energy of the system and hence change the chemical 
potential. These energy terms are of order O(v). The chemical potential is not changed by V 
in one-particle or one-hole problems, since the one particle does not contribute any energy 
term proportional to v. The series in F,,(p, t) shows excellent convergence in these cases. 
These results will be discussed in Chapters 7 and 9. The other problem with this method 1s 
that, when it works, one has some fairly complicated functions of t in the exponent. One is 
invariably forced to use a computer to perform the Fourier transforms in frequency space. 
This step is not really a difficulty—yjust some work! 

The physical model implied by this regrouping will be discussed in Chapter 7. It is a 
systematic development of the Tomonaga (1947) model of pion emission in particle theory. 
Tomonaga assumed that all pion emissions from a particle were statistically independent— 
there was no correlation between successive emissions. His model corresponds to the 
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approximation of just using F} alone. However, if one uses F} + F, then F, puts in the 
correlation between pairs of emissions. Similarly, F, puts in correlations between three 
particle events, etc. The advantage of the method is that the Green’s function includes many- 
particle emission processes, even when one keeps only a few terms in the series for F,. Solid 
state problems do not have pions but other boson excitations such as phonons or the boson- 
like excitations of the electron gas. 


3.7. REAL-TIME GREEN’S FUNCTIONS 


Six different Green’s function of time were introduced in Chapter 2. Two of them, the 
retarded and advanced functions, have been discussed at nonzero temperature. The other four 
are discussed here. The usefulness of the real-time functions is in the treatment of non- 
equilibrium phenomena. Nonequilibrium transport theory using real-time Green’s function is 
treated in Chapter 8. The present section discusses their properties at nonzero temperature. 

The Matsubara method is unsuitable for nonequilibrium since there is no thermo- 
dynamic basis for temperature for a system out of equilibrium. The entire Matsubara method 
is based upon temperature, and no method has been found so far for extending it to non- 
equilibrium processes. 

The real-time functions at nonzero temperature have formal definitions very similar to 
those at zero temperature. Comparing with (2.144), the Green’s functions for electrons of 
momentum p are 


G*(p, t, h) = i(Cho(t)Cpo(t)) 

G” (P, ti, t) = —i(Cyg(t,) Cho (ty)) 

GAP, h, b) = OG —H)G(P, ti, h) + Oh — 4), ti, t) 

GAP, ty, h) = Of — 4)" O, 4, b) + OG — h)G*@, th, b) (3.322) 


These definitions appear to be identical to those at zero temperature. The important difference 
at nonzero temperatures is that the brackets (---) have a different meaning. They have several 
different meanings, depending upon the circumstance: 


Case 1: At zero temperature, and in equilibrium, the brackets (---) denote the ground 
state of the interacting system. 

Case 2: At nonzero temperature, and in equilibrium, the brackets denote the thermo- 
dynamic average as given in (3.25) or (3.82). 

Case 3: When not in equilibrium, the brackets denote an average over the accessible 
phase space. However, the available phase space depends upon the recent 
history of the system, and kinetic constraints such as energy conservation. The 
meaning of the brackets is poorly understood for systems out of equilibrium. 


Another feature of (3.322) is that the Green’s functions are not expressed on the left as 
functions of (t; — t) but of (¢,, t,) separately. In equilibrium, they are functions of (ti — b), 
but that may not be true for systems out of equilibrium. 

In thermal equilibrium, the real-time functions each have a simple relation to the retarded 
function. Each is found easily from G,- Since G, is found easily from the Matsubara 
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function, the easiest way to find the real-time functions is by first finding the Matsubara 
function of complex frequency. These statements apply only in thermal equilibrium. 

In thermal equilibrium the following derivation establishes the relationships among the 
real-time Green’s functions. Using the states |n) and |m}, which are exact eigenstates of the 
Hamiltonian, two of the real-time functions in (3.322) can be written as 


GA (Pp, ty — ty) = iX ee | (n|Cpolm) Per Fe) (3.323) 
G? (p, ty — b) = —i P| (nC g |) | e" nF) (3.324) 


The time dependence is Fourier transformed to œ, which produces the delta function 
275(o + E, — Em) in each term. These terms have many of the same factors that occur in the 
spectral function in (3.120). They differ mainly in the thermal factors exp(—BE,, m). They can 
be made to be identical by utilizing some algebraic relationships such as 


e Fen S(m + E, — Ep) = e71 + e.ll + E, — Em) 
= [e Fr + eP] .(w)8(@ + E, — Em) 
In the first line the factor of np (œ) has been added to both numerator and denominator. In the 
second line, one of these factors has been changed to the sum of two exponential factors by 
using E„ — œ = E,. The part in brackets is exactly the combination that appears in the 
spectral function: 


G“ (p, œ) = in-(@)A(p, œ) (3.325) 


The other real-time functions can be derived using the same technique: 


G7 (p, ©) = —i[1 — nr (0) AE, o) (3.326) 
G,(p, @) = [1 ~~ Nr(O)|Gret(P, o) + np(@)Gaay(P, o) (3.327) 
GP, o) = -[1 = N-(@)|Gaay(P, o) ~~ Np ()Gyee(D, o) (3.328) 


These expressions are identical to (2.160). In equilibrium the real-time Green’s functions are 
found easily from the retarded and advanced functions. The latter two are found easily from 
the Matsubara functions. In equilibrium there is no need to set up a separate formalism for the 
real-time functions. 

The primary usefulness of the real-time Green’s functions is in the theory of non- 
equilibrium phenomena. Some of these ideas and techniques are introduced here. The first 
step is to derive Dyson’s equation. In fact the result (2.157) at zero temperature is also correct 
for nonzero temperature. The matrices have the definitions that are given in (2.156). There is 
no reason to rederive the same equation. Instead it seems appropriate to comment on the 
differences in the derivation between zero and nonzero temperatures. 

Equation (2.157) is generally regarded as being the correct form for Dyson’s equation, 
even for systems out of equilibrium. However, we have never seen a satisfactory proof for 
nonequilibrium systems. For systems in equilibrium, one can derive (2.157) at nonzero 
temperature by starting from Dyson’s equation for the Matsubara Green’s functions. Treating 
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T as a complex variable, one can deform the contour of integration, and end up with (2.157 
(Langreth, 1976). : 

For nonequilibrium, the Matsubara functions are obviously an invalid starting point. 
Instead, one can try to expand the S matrix in real time. Then one encounters the problems 
mentioned in Sec. 3.1 regarding expanding two S matrices: one for exp(—BH) and another 
one for exp(+itH). All derivations ignore the former. Including only the S matrix of time, one 
again finds (2.157). It appears correct to omit the S-matrix expansion of exp(—BH). This 
point is discussed in the next section. Equation (2.157) is used for Dyson’s equation for 
nonzero temperature, for equilibrium and nonequilibrium, since there is nothing else avail- 
able. 

Nonequilibrium theory usually proceeds by first deriving an equation of motion for the 
Green’s function, which is similar to a Boltzmann equation. This equation is usually derived 
for systems in equilibrium, or slightly out of equilibrium. Then the equation is applied to 
systems far from equilibrium. 

The first step is to find an equation of motion for the mteracting Green’s function when it 
is not in equilibrium. Such an equation can be found from (2.157) by operating by 
(id/dt — H) on both sides of the equation. First one needs to know the time evolution of the 
noninteracting Green’s functions. This behavior is deduced easily from (2.144) and (2.148). 
The time derivatives can be expressed compactly in the matrix notation. Remember that 
d@(t)/dt = d(t). 


(2 — x) Gy(k, t) = (N1 (3.329) 
(a — m) Gy(x) = (x) (3.330) 


On the right-hand side of (2.157), this operator acts only upon Gy for which the above result 
is used to find 


(à — Hy) G(x), x2) = f(x; — x) + | deste ,X3)G(X3, x2) (3.331) 


This formula provides the equation of motion for the interacting Green’s function. It is the 
basis for the nonequilibrium theory of interacting systems. 

The structure of this equation is interesting. On the left-hand side, only the noninter- 
acting terms are contained in the Hamiltonian Hy. The contribution from the interactions is 
provided by the self-energy functions on the right-hand side. g 

So far, the equation of motion has been derived for the variable x, in G(x,,x,). It is 
useful to have an equation of motion for the other variable x,. Since the definition (2.144) of 
the Green’s functions contains the conjugate wave function yi (x,), the equation of motion on 
this variable is the complex conjugate of Schrédinger’s equation. Furthermore, Dyson’s 
equation can be written in an alternate form. Instead of (2.157), an alternate form is to have 
Go on the right in the interaction terms: 


G(x, x2) = Gox: — x2) + |a | iča x), X4)Go(X4 — x2) 
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The equation of motion on the x, variable, when using this form for Dyson’s equation, still 
acts only upon Gy on the right and produces delta functions. These steps produce the alternate 
equation of motion for the Green’s function: 


a - - 
(-i5, — Hy(r2, -p»)) G(x}, x2) = Sx, — X_)f 
+ | dx G(x, ; x3)E(x, ; Xo) (3.332) 


The sign change on p, comes from the fact that the left-hand side of the above equation is the 
complex conjugate of Schrédinger’s equation. The operator p = —iV changes sign under 
complex conjugation. This behavior is different from the Hermitian conjugate, where Ho acts 
to the left on yt (x) and p does not change sign. These two equations of motion will be used 
in Chapter 8 to develop a quantum Boltzmann equation for nonequilibrium phenomena. The 
quantum Boltzmann equation is based upon a many-body distribution function first suggested 
by Wigner (1932). 


3.7.1. Wigner Distribution Function 


The traditional Boltzmann equation is expressed in terms of the distribution function 
f (r, v, t). The three variables are position, r, velocity, v, and time, t. The point of view is 
semiclassical since it is assumed that the position and velocity (momentum) of the particle 
can be defined simultaneously. In order to use this distribution for quantum systems, it is 
necessary to perform some type of averaging in order to remove effects due to the uncertainty 
principle. 

If quantum effects are important, it is necessary to introduce another variable into the 
distribution function. This variable could either be energy E or equivalently a frequency 
œ = E. The resulting distribution function is often called a Wigner distribution function 
f (k, œ; r, £). The velocity v has been changed to the wave vector k = mv. The distribution 
function is derived from the Green’s function G< (x4, x,) defined in Eq. (2.144). The Wigner 
distribution function (WDF) is derived using the following series of steps. First go to a center- 
of-mass coordinate system: 


(R, T) =4(x, + xy) (3.333) 
(r,t) =x, -x (3.334) 


Note that 7 means center-of-mass time, rather than temperature or time-ordering. The symbol 
B is used for the inverse of Boltzmann’s constant times temperature. The notation on the 
Green’s function is altered to these center-of-mass coordinates: 


G<(r, t; R, T) = idp'(R — ir, T -AR +4 r, T +40) (3.335) 


The next step is to Fourier transform the relative variables (r, t) into (k, œ): 


G<(k, œ; R, T) = fre | dte’™ G<(r, t; R, T) (3.336) 
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The relation to the WDF is quite simple: 
f(k, œ; R, T) = —iG*(k, œ; R, T) (3.337) 


Regard this assertion as the definition of f(k, œ; R, T). This choice is made reasonable by 
showing that various moments of f provide the macroscopic quantities of particle density 
n(R, T), particle current j (R, T), and energy density ng(R, T): 


Pk (© d 
n(R, T) = aa 2 fik, o; R, T) = (WIR, TVR, T) 
dP 00 
7(R, T) = l E | x fik, ©; R, T) (3.338) 
Bk [* 
ng(R, T) = jem f 2 of tk œ; R, T) 
— É a(R, T- IAWR, T + D 
t=0 
= (VİR, T)HW(R, T)) (3.339) 


The technique for solving nonequilibrium problems is very simple. The equation of motion in 
(3.331) or (3.332) for G<(k, œ; R, T) is just the quantum Boltzmann equation (QBE). This 
equation is then solved, which yields directly the Wigner distribution function f (k, œ; R, T). 
Macroscopic variables such as current and density are found by taking the above integrals 
over d?k and dw. This technique is also useful in systems that are not homogeneous because 
there is a slowly varying potential V(R). 

The above expressions are useful for nonequilibrium situations. For example, the answer 
is zero if the equilibrium function (3.325) for G~ is used to calculate the current j, in (3.338). 
That is the right answer. A system carrying current can be in steady state but not in equili- 
brium. In order to obtain a nonzero current it is necessary to solve for the nonequilibrium 
Green’s function G< when the Hamiltonian contains an electric field. This procedure is 
described in Chapter 8. 

The semiclassical Boltzmann distribution function f(R, v, T) is found by taking the 
frequency integral of the WDF: 


OO 


F(R, v, T) = l S finv, ©; R, T) (3.340) 


~O 


This method of deriving the semiclassical Boltzmann equation is an alternative to the usual 
technique of coarse grain averaging. 

The Green’s function G< plays a central role in the nonequilibrium theory. In order to 
solve it one also needs to know the retarded function. 
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These two Green’s functions are the most important. The matrix equation in (3.331) and 
(3.332) is untangled to present the individual equation of motion for these separate Green’s 
functions. 


E — Hats) G*(x1,X) = | desIB (xy 25), X2) 
1 
— È< (x1, x3)G7(X3, x2)] 
ise — m| Gret(X1,X2) = St — Xp) 
1 


+ | desl) Gets.) (3.341) 


-2 — Hoea) |0 = | dsla, E s, x2) 
2 


— G< (x1, X3)27(%3, x2)] 
_ oO x 
|- — H(x2) |e , X2) = S*(x, — X2) 
2 
+ | dx: Gret(X ’ X3 )Zret (X3 ’ Xp) (3.342) 


The equations are the starting point for the derivation of the quantum Boltzmann equation. 
The WDF is an elegant and useful formalism. It does have some liabilities. If one 
constructs the WDF for any nontrivial Hamiltonian, then f(r, œ; R, T) has regions where it is 
negative. The function is not positive definite, which means that it cannot be interpreted as a 
probability density. This feature can be shown by a simple example. Consider in one 
dimension a particle in a box of length L. The eigenvalues and eigenfunctions are 


2 
, (x) = 2 sin(k,x), £, = en? (3.343) 
2 
nv TU 
k, = T , E= mL (3.344) 
Using the representation in (2.147), and setting t = t, — h, gives 
2i , 
G< (x4, x>, t) = Z S np(€,) sin(k,x) sin(k,x)e (3.345) 


Here xı, are not four-vectors but one-dimensional coordinates of position. The product of 
two sine functions can be written as the difference of two cosine functions. Taking the Fourier 
transform of time gives the function of frequency. Since the system is in equilibrium, write 
G< = in,;(o)A. In the center-of-mass notation 


F(x, 0; X) = np(@)A(x, ©; X) (3.346) 


A(x, ©; X) = a YIcos(k,x) — cos(2k,X)|5(@ — en?) (3.347) 
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where x = x, — x is relative position and X = (x, + x2)/2 is center-of-mass position. The 
function A(x, @; X) is a sum of delta functions. The amplitudes of the delta functions have 
variable sign, depending upon the values of x and X. Neither A nor f are positive at all points. 

For many Hamiltonians of interest the function f is not positive definite. That creates 
philosophical problems if f is interpreted as a probability function. Do not interpret f this 
way. Instead, f is regarded as one step in the calculation. It is never the last step, since it is 
used to calculate other quantities that can be measured. Physically sensible results are always 
found for measurable quantities such as the particle density and current. No problems are 
encountered as long as one avoids interpreting f as a probability density. 


3.8. KUBO FORMULA FOR ELECTRICAL CONDUCTIVITY 


Many experiments in condensed matter physics measure the linear response to an 
external perturbation. The experimentalists may put the sample in a magnetic field, electric 
field, optical field, temperature gradient, or pressure field and measure the magnetization, 
electrical current, light absorption, or whatever. Linear response means that the signal is 
directly proportional to the intensity of the external perturbation. Usually the assumption of 
linear response is valid at low magnitude of perturbing field. 

Kubo formulas are the name applied to the correlation function which describes the 
linear response. There are many of them, since there are many possible perturbations and 
many linear responses for each perturbation. Formulas of this type were first proposed by 
Green (1952, 1954) for transport in liquids. Kubo (1957, 1959) first derived the equations for 
electrical conductivity in solids and in the form which will be used here. His derivation and 
result will be given below. Other Kubo formulas will be derived in this section. In fact, several 
derivations are given of the formula for the electrical conductivity in order to familiarize the 
reader with different types of derivations in the literature. One finds formulas which appear to 
be quite different but are really identical. It is also demonstrated that the longitudinal and 
transverse conductivities are identical at long wavelength. 

In electrical conduction, a time-dependent external electric field, 


E&Y (yr, t) — mle) eat —iot (3.3 48) 


is applied to the solid, where « = x, y,z are the directions in space. In linear response, the 
induced current is proportional to the applied electric field 


J (r, t) = 2 015 (q, o) 2E. earo (3.349) 


Although the symbol o’ has the appearance and dimensions of a conductivity, it is not the 
appropriate conductivity. Instead, the conductivity is needed which is the response to the total 
electric field in the solid. The applied or external field EC” induces currents which in turn 
make other electric fields. The summation of all these fields is the total electric field, which is 
called Z,(r, t). The conductivity o is the one which responds to the actual electric field in the 
solid: 


J,(r, t) = } ` Opg(q, ©)Ep(r, £) (3.350) 
B 


E,(r, t) = Z, exp[i(q -r— op] (3.351) 
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Take Eq. (3.350) as the fundamental definition of the microscopic conductivity. It also 
introduces the coordinated problem of determining the proportionality between the external 
electric field E*» and the total internal electric field E, (r, t). For a static, longitudinal electric 
field, the two electric fields are related by the continuity of the normal component of D at the 
surface. A similar relation for transverse fields is derived below. Equation (3.350) is correct 
for a homogeneous material. Actual solids are not homogeneous, although crystals are 
periodic. However, (3.350) implies the spacetime response may be written as 


t 
J (r, = jer | dtS plr — r’, t — tE, t) (3.352) 


where repeated indices imply summation. This equation is the Fourier transform of (3.350). It 
assumes that the current response of a material at r is only a function of the separation (r — r’) 
from the external electric field at r’. This assumption is incorrect on an atomic scale. It is 
certainly not valid in the atomic-like core states of a solid or in any case where the separation 
(r — r’) is a few angstroms. A rigorous formulation would have the conductivity as a function 
ofr and r’ separately, o(r, r’; t — 7’). In solids, it is permissible to use (3.350) only when it is 
understood that the current is to be averaged over many unit cells of the solid. Usually it is 
applied when q is small and long-wavelength excitations are being studied. 

Quite often our interest is in the dc conductivity, which is obtained by taking the limits 
q — 0 and œ — 0 in that order. Then the conductivity is only real. The Kubo formula is 
derived assuming that only a single frequency is perturbing the system and that 0,,(q, œ) is 
the response to this single frequency. Actually, using a single frequency is not a restriction. It 
is assumed that the system is linear and perturbations at different frequencies act indepen- 
dently. The total current is then the summation of the responses at different frequencies. 

The Hamiltonian for the system is taken to have the form 


H+H’ (3.353) 


The term H’ contains the interaction between the total electric field and the particles of the 
system. Equation (2.161) is used as the basic form of the interaction between electromagnetic 
fields and charges. The electric field is expressed as a vector potential, so that 


H' = — - l Prj (A,r, t) (3.354) 
1 —i 
a al t)= y EAr, t) (3.355) 


Select the Coulomb gauge, where V - A = 0, as explained in Sec. 2.10. Also, the electric and 
vector potentials are taken to be transverse, so the scalar potential is set equal to zero. The 
terms in (2.161) with A? are dropped, since their effects are nonlinear in the electric field. The 
term H contains all the other terms and interactions in the solid or liquid. There are inter- 
actions such as electron—electron, electron-phonon, spin-spin, with impurities, etc. The goal 
is to calculate the electrical conductivity when all these other interactions are present. 

The current operator in (3.354) was discussed in Chapter 1. It has the form of 


Jnl) = 5-5. elpu — r) + òl — Pil (3.356) 
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The notation is more compact if the dr integral is evaluated in (3.354), which gives H’ in 
terms of the Fourier transform of the current operator: 


H' = =j (02e (3.357) 


. l ig-r; iq, 
Jaq) = 5 2 eilPae” i+ ea Dig] (3.358) 


In terms of creation and destruction operators, the current operator is conventionally written 
as 


jq) = Lpr” CC (3.359) 


where À, ò are the states associated with some unperturbed Hamiltonian Họ which is chosen 
as the basis for the perturbation expansion. Several of these possibilities were discussed in 
Chapter 1. A distinction is made between the current operator j, in (3.358) and the induced 
current J, in (3.350). The operator j, is used in the Hamiltonian, while J, is the actual current 
measured by the experimentalist. The measured value of the current is the average value for 
the velocity of the particles in the system, which is taken as the summation over all the 
particle velocities divided by the volume: 


l 


Jar, t) = Ar Vig O(T _ r) = ET (Va) (3.360) 


When quantizing the particle velocity in (1.410), the velocity is the momentum minus the 
vector potential: 


y=} |p; —<A(r) | (3.361) 
m C 
e e 

Je = EPn) EAr) (3.362) 


The momentum operator p; is proportional to the current operator, j = ep;/m. The last term 
uses the relationship (3.354) between the vector potential and the electric field. For this term, 
the summation over particles divided by volume is replaced by the density, which is appro- 
priate for a long-wavelength disturbance: 


2 
J,(r, t) = (j,(r, t) + i E (r, t) (3.363) 


The second term in the current is proportional to the electric field and the first term given by 
the expectation value of the local current operator. The latter term is also proportional to the 
electric field. Later it is shown that the constant of proportionality is given by the Kubo 
formula. 

These two terms are called J and J®: 


J= IM 4° (3.364) 
: 2 
yO = -0 E(r, f) (3.365) 
m 
J® = (j(r, 0) (3.366) 


The next step is to derive J”, which is the derivation of the Kubo formula. 
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3.8.1. Transverse Fields, Zero Temperature 


The following derivation of the Kubo formula is valid at zero temperature. The first step 
is to consider the expectation value of the current operator as a function of time: 


JM, 1) = WV eh Y eT’) (3.367) 


The Heisenberg representation is used, as explained in Sec. 2.2. Next go to the interaction 
representation, where H’ is treated as the perturbation. 


e`H+H Y — eH U(f) (3.368) 
U(t) = e" e tata (3.369) 
ID (x, t) = (VIUDO FUON) (3.370) 
The operator U(t) was defined earlier in (2.14), and a formal solution was derived in (2.27): 
t 
U(t) = Texp| =| a'H'@)| (3.371) 
0 
where the usual definitions are 
H'(t) = el He 4 (3.372) 
jO = P je ™ ete. (3.373) 


The wave function |\’) in (3.367) is the Schrödinger wave function at t = 0 for an interacting 
system with both H + H’ as the Hamiltonian. As discussed in Sec. 3.1, the wave function |\) 
is appropriate when H’ is absent from the system. The relationship between the two wave 
functions is [see (2.36)] 


IW) =T exp| -i f dt' H’ ow (3.374) 

The time development of the system is given by combining these results: 
U(t)|\W) = T exp i] aO) (3.375) 
= S(t, —œ)| y) (3.376) 


The expectation value of the current is now 
J = (WIST(t, —co)j,(r, HSE, —00)|) (3.377) 


An exact solution of this equation is not needed for the derivation of the Kubo formula. Only 
terms linear in the electric field E, are desired, which requires us to only keep terms linear in 
H’. It is sufficient, for linear response, to keep the terms 


t 


S(t, —0o)|) = Í — i| aH) hy) + O'Y (3.378) 
The Hermitian conjugate of this wave function is 
(VIST (t, —00) = wif + i| HHO) + O(H’y (3.379) 


—0O 
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From (3.377), the expectation value of the current operator is 
t 


JO(r, t) = E + if aH’) |j at — i| aH'@)| Ny) 


= (Y| E; t) ~ | dt'| j,(r, t)H’(t’) ~~ A(t yj(r, al IW) 


The first term is assumed to vanish, 


(War, Ol) = 0 (3.380) 


since there is usually no current in the solid in the absence of an electric field or something 
equivalent such as a time-varying magnetic field. The first nonzero term is the one linear in 
H’, which is the important contribution. There are two terms, which can be expressed as a 
commutator: 


Jr, t) = -i| dt Olla, À, EEO) (3.381) 


The goal is to derive Eq. (3.350). A factor of E,(r’, r) must be removed from the integrand. 
With H’ given in (3.358), the integrand has the factors 


[alt D, HO] = ~ Bye ljal, À, jela 0 


= —Ep(t, eT OO j (r, t), jg(q, t’)] (3.382) 


which are regrouped in the following way: 


l —iq-r /_ i@(t— . : 
JP == Eplr, Ne | dt Lj, Ò, jela, PIN) 


A factor of exp[i(q - r — œt)] was taken outside of the integral and, to compensate, the inverse 


of this factor was left inside the integral. Comparing this result with (3.350) shows that they 


have the same form. The current J,” is proportional to the electric field. The constant of 


proportionality is the conductivity: 
l arf io(t—t , , 
opo eA ye, Dja CIN) 
—0O 


+i 8, (3.383) 


However, this result is not quite right. The final step is to average over the space variable r in 
order to eliminate atomic fluctuations. Take this average by integrating over all volume d?r 
and then divide by v. The only r dependence of the expression are the factors 


| Breat, t) = j,(—q, t) = jį(q, t) (3.384) 


The final result for the conductivity is 


1’ | 2 
Saplaq 0) => | EXI LNA, Djela CI) +7 Sp 68-385) 


Equation (3.385) is the Kubo formula. 
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The correlation function in (3.385) is only a function of the time difference t — r’. If this 
difference is made the variable of time integration t — t’ — t, the formula can be expressed as 


ore) e? 
opo) = ea Dla ON) +E Bap 


which is the way it is usually written. The right-hand side is definitely not a function of t. 
The wave function |W) in (3.385) is the ground state of the many-body Hamiltonian H. 
This Hamiltonian contains all the possible interactions in the solid except the interaction with 
the vector potential H’. The conductivity is calculated using (3.385), which has no mention of 
photon field. The conductivity is an intrinsic property of the ground state of the system. 
Equation (3.350) can be viewed as a Taylor series in the applied electric field = = gr 


J,(Eg) = J,(0) + (ze ja + o$? (3.386) 
=p 


The conductivity 6,3 = (3J /3Eş) is calculated for =) _ 0, It is a characteristic of all linear 


response correlation functions that they are ground state properties. A major difficulty is that 
the ground state is not known for most many-body systems. Then one must evaluate the 
correlation function using the S-matrix expansion and Green’s function analysis, which has 
been described in Chapter 2 and in this chapter. Equation (3.385) is often the starting point for 
a many-body calculation, as shall be demonstrated in later chapters. 

The Kubo formulas contain a retarded, two-particle correlation function. From the 
definition of the retarded Green’s function in (3.88), the retarded correlation function of the 
current operator is defined as 


Tela t= A) = = Ol- “WIEN, Dja DIN (3.387) 
Its Fourier transform is 
Mgao = -i| dedo- ALa, Dja N 


Comparing this definition with (3.338), we find that 


o.p(q, ©) = = [sa @) + E a (3.388) 


The conductivity is the retarded correlation function of the current multiplied by i and divided 
by œ. The correlation function I1,g(q, œ) is usually called the current—current correlation 
function. 

It is usually easiest to calculate the retarded correlation function in the Matsubara 
formalism. The procedure follows the prescription outlined in (3.105) and (3.110). First 
define the equivalent current—current correlation function in the Matsubara formalism: 


l! 
Tp (a, 1) = — > (T, j4(a, yla, 0) (3.389) 


p , 
1,9(q, i@,) = | dre™ TI plq, 1) (3.390) 
0 
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The Matsubara function is evaluated as best one can, perhaps by using the diagrammatic 
techniques described in Sec. 3.4. Then the desired retarded function is obtained from 


change 


II j 
iO, > © + ið «p(G, 10,) —> T1,9(q, ©) (3.391) 


The conductivity is found from (3.388). These steps are always used in calculating the 
conductivity. 

The dc conductivity is obtained by taking the limit q — 0 and then the limit œ — 0. The 
wrong answer may be obtained if the order of these limits is reversed, which may be 
understood on physical grounds. The limit œ = 0, q Æ 0 describes a static electric field, 
which is periodic in space. Here the charge will seek a new equilibrium, after which no 
current will flow. It is usually important to first take the limit of q — 0: 


0.p(4; o) = O.p(@) 
II j = II, 
1) Melao) = Tg) 


Ju(Q, T) = Jy) 


The limit q — 0 presents no problem. The current operator is well behaved in this limit, so 
that the correlation functions are well behaved. In fact, when only the dc conductivity 1s 
needed, it simplifies the derivation to set q = 0 at the beginning of the calculation. 

The limit of œ — 0 is more delicate. Here the conductivity is real: 


oi 
Reo,g = — lim z ml pCO)! (3.393) 


The right-hand side contains the imaginary part of the retarded correlation function. The 
combination —2Im[I1,,(@)] is just the spectral function of that operator; call it R,g(@). By 
using the general formula (3.117) for the spectral function, a formal solution for ImIT is 


—Im[I1,.()] = 5R,g(@) 


Too E | | 
== (1 — 2 PeyeP® Y eE (n) jh lm) ral gln) 


x 6(@ + En — Em) (3.394) 
Now it is straightforward to take the limit œ — 0, since the prefactor of the above equation is 
1 
im —(1 — eP®) = 3.395 
lim (i —e"") = B (3.395) 
and 
Re(,g) = mF ein ye PP (n| jilm) (ml jgln)5(E, — Em) (3.396) 


The conductivity is finite as œ — 0. There is no divergence in this limit, even though there is 
the œ~! factor in (3.393). 

It is a curious feature of the Kubo formula that, in order to calculate the dc conductivity, 
it is necessary to calculate the ac conductivity and then to take the limit œ — 0. At least this is 
the easiest way to do it. Later another Kubo formula will be derived strictly at œ = 0. 
However, the latter formula is cumbersome to evaluate, and it is faster to evaluate o(@ = 0) 
by the procedure of taking the ac conductivity and the limit that œ — 0. It is also easier, at 
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zero temperature, to use the Matsubara formalism and then to take the limit of T — 0 at the 
end. Below it is shown that (3.385) is the nght Kubo formula at nonzero temperatures. 

The current-current correlation function is a two-particle correlation function. The 
current operator contains a product of one creation and one destruction operator, so the 
correlation function (3.390) contains at least four such operators: 


l ; 
Mela D= -7 3 PCDP” CANT. ECCO) 
„BV 


The correlation function describes how two particles are created and destroyed. The 
conductivity arises from correlations between these two events. 

The conductivity is a measurable quantity. It is the same quantity as measured in a circuit 
or by an optical probe. Measurable quantities always involve retarded correlation functions of 
at least two particles. The one-particle Green’s function can never be measured—at least not 
in the rigorous sense. The one-particle Green’s function describes a Gedanken experiment 
which can never happen in practice. One creates a particle at time 1’, 


G(p, T— T’) = —(T,Cpo (C$ (T) (3.397) 


in state po and then destroys it at another time t. One asks for the correlation between these 
events. This experiment is impossible, since real particles cannot be created or destroyed. In a 
two-particle Green’s function, one describes a sequence of events which is realistic. A particle 


(T,Ch(t)C,(t)C}(0)C,(0)) (3.398) 


is changed from one state to another, say from v to p, and then at a later time it is changed 
back. The two-particle correlation function, wherein a particle changes its state, is always 
found in the correlation functions of linear response. In elementary particle physics, a particle 
can be absolutely destroyed or created. But this event always happens in conjunction with 
some other event which involves other particles. For example, an electron and positron can 
mutually annihilate and make several photons. Then one would have terms in the current 
operator involving the creation or destruction of two particles: 


jul) = Yt pt?’C,ds + hc] (3.399) 


Even in this case, the current—current correlation function involves four operators: two for one 
particle and two for another. If the particles interact, i.e., by a Coulomb interaction, then the 
correlation function may not be divided into two independent Green’s functions: 


(T.C, (t)d3(t)d{(0)Cl(0)) + (TC, (1) Ch (0)) (T,d5(t)d,(0)) (3.400) 


We shall encounter current operators of the form (3.399). First, positron annihilation is an 
important experiment in metal physics (Brandt and Dupasquier, 1981; Mills et al., 1986). The 
form of (3.399) is also used in other contexts. If an electron is destroyed in a filled band, the 
resulting excitation is called a hole. This terminology is used in the filled valence band of a 
semiconductor and also in the core levels of all solids. There is an identity 


C, | electron } = di hole (3.401) 


filled empty 
band band 
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where d is the creation operator for the hole. When taking an electron from a filled band and 
moving it to an empty or partially filled band, then one has a current operator of the form 
(3.399), which is used to describe these electron-hole excitation processes. 


3.8.2. Nonzero Temperatures 


The preceding derivation of the Kubo formula has several restrictions. It was limited to 
zero temperature and transverse electric fields. Both of these restrictions are unnecessary, 
since (3.385) is correct for both nonzero temperature and longitudinal electric fields. At 
nonzero temperature, the bracket is interpreted as a thermodynamic average. The restriction to 
transverse electric fields was actually unnecessary. One could use a gauge wherein the scalar 
potential þ was zero but not the longitudinal vector potential. Then the longitudinal electric 
field can be expressed in terms of the longitudinal vector potential. The derivation is the same, 
step by step, with the same answer. 

Another derivation is presented which is valid at nonzero temperature. A time-varying 
perturbation H(t) is put into the system at nonzero temperature. The central question 
concerns the degree to which the thermodynamic averaging is influenced by the time-varying 
interaction: is H’ put into the thermodynamic weighting factor exp[B(Q — H — H’ + vN)] as 
well as into the time development of the operators? As an example, is the following equation 
a correct expression for the time development of the current operator? 


JO È Py {ehO-H-H'O+WN) gill +H Oljg-iMH+H'O)} (3.402) 


Although this expression is reasonable, it is believed to be wrong. It is incorrect to just put H’ 
into the thermodynamic weighting factor. If the time oscillation is fast enough, the heat bath 
will not follow the oscillations. The oscillatory field causes Joule heating, but that contri- 
bution is proportional to the square of the electric field, and is not part of the linear response. 

The above formula may be manipulated by moving the right-hand operator in the trace to 
the left, and by expressing the time operators as a time-dependent density operator: 


JË = Trip(ty] (3.403) 
p(t) = oH +H’) B(Q-H-H')+uN) itl H+H) (3.404) 


However, since the middle exponential factor also contains the operator combination H + H’, 
it commutes with the time operators, so that the above predicts 


p(t) = eM@-H-H'@+uN) (3.405) 


The time dependence of the current operator plays no role if the above expression is correct. It 
was derived by assuming H’ goes into the thermodynamic weighting factors. Since our 
intuition is that the time development of the current operator should not cancel, the above 
ansatz is incorrect. 

Kubo’s derivation of the time-dependent density matrix proceeds in the following 
fashion. The density matrix pọ = exp[B(Q — H + pN)] applies to the equilibrium system in 
the absence of H’(t). It is assumed that the system is described by this density matrix at the 
initial point in time, which is taken at t = —oo. The perturbation H’ is adiabatically switched 
on as the system is brought forward in time to the present. Now the time-dependent density 
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matrix, for the interacting system including H’(t), is called p(t). It obeys a Heisenberg 
equation of motion, which when solved yields an expression for the electrical current: 


d 
7 Pit) = iH + HO), pO] (3.406) 


JP = Tripa] (3.407) 


Kubo’s starting point is tantamount to not including H’(t) in the thermodynamic weighting 
factor. The density matrix is defined as p(t) = pọ + f(t), where pọ is the density matrix in the 
absence of H’. The equilibrium density matrix pọ is time independent so that the above 
equations simplify to 


d 
[H, po] = 0 (3.409) 


The objective is to solve for the term in J, which is proportional to H’, which is treated as 
infinitesimal. Since f is proportional to H’, it follows that f is small. Terms proportional to 
O(H’)’, such as [H’, f], are neglected. 


d 


iZ f = [Hf] + [H', po] (3.410) 


This equation is solved by moving the first term on the right to the left: 


iE f — H, f) = IR, pol (3.411) 


and then expressing the left-hand side as 
uH); d o iH pin h itt a / 
e AG e" )he =i7 f— lf] = 1, Pol (3.412) 


The linear differential equation may be integrated to give 


. d i —i i / —i / 
i (el fe) = HIR, pole™” = [H'O Pol (3.413) 
t 
f(t) = f(—%) - emt | dt' [H' (t), polje" (3.414) 
—oo 
The interaction is switched on slowly in time, so that at t = —oo there is no interaction, and 


H’ = 0. Of course, then f(—oo) = 0, since this term only exists when H’ exists. A solution 
for f has been derived which is proportional to H’, which is adequate for a description of 
linear response. 

The evaluation of (3.407) is 


JP = Tripo jd + TH S Oa] (3.415) 
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where the first term on the right vanishes (Tr[po j4] = 0) since there is no current if there is no 
external applied field. The expectation value of the current, proportional to H’, is the 
remaining term: 


J” (r, t) = -ire ett f dt'[H'(t’), pl ej) (3.416) 
— -ml f dt'{H' (t), poleje} (3.417) 
= -md | dt'|H'(t’), Polja, D| (3.418) 


The quantity in curly braces was rearranged by using the cyclic properties of the trace. The 
factors on the right are just the time development of the current operator: by using the cyclic 
properties of the trace, the three operators can be rearranged: 


Tr[H" (r), Pol JQ, t) = Trl'(t') po ja, t) T PoH (CYT, t)] 
= Tr{pol ja, OAC) — HCY, ÐI} 
= Tr{Polja(r, £), H'O) (3.419) 


The term in curly braces is just a commutator. The final result is derived for the expectation 
value of the current, which is just the thermodynamic average of the commutator of H’(¢’) and 


jal, t): 
IQ, t) = -i l | dt ([ jar, t), H'(t’))) (3.420) 


Equation (3.420) is precisely the zero-temperature equation which was derived earlier in 
(3.381). It leads, in the same way, to the Kubo formula (3.385). Previously the angle brackets 
meant to take the expectation value in the ground state at zero temperature, while now they 
mean a trace over the thermal distribution at nonzero temperatures. Equation (3.385) is valid 
at nonzero temperatures. 


3.8.3. Zero Frequency 


The Kubo formula (3.385) has been shown to be valid at nonzero temperatures and for 
transverse electric fields. Also, it was suggested that it is valid for longitudinal electric fields. 
However, the purist might still object. All the derivations introduce a frequency œ, and the dc 
conductivity is obtained by letting œ — 0 at the end of the derivation. Is it possible to provide 
a derivation of the dc conductivity which assumes a constant electric field throughout the 
derivation? Of course, the answer is affirmative, which provides the motivation for the next 
derivation, using a method suggested by Luttinger (1964). 

Begin by assuming an electric field Eọ which is static in time and constant in space. It 
causes a scalar potential 


(r) = -Ep ' r (3.421) 
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and the external vector potential is zero. This scalar potential introduces a perturbation term 
into the Hamiltonian which is 


eF = e" l o(r)o(r) (3.422) 


where p(r) is the charge density operator of the system. A factor of exp(st) was added to this 
term so that it vanishes as t —> —oo. This factor represents the switching on of the potential as 
time develops from the past. It is assumed that the perturbation on the density matrix, 


P = Po + fe" (3.423) 


also has this term. The first term pọ = exp[B(Q — H)] is the density matrix when F is absent. 
The perturbation fe” is due to F. The starting point is the equation of motion for the density 
operator, 

OP 


i = [H + eF, p + fe") (3.424) 


which is solved as before. The term proportional to the perturbation F is given in (3.414): 


t t 
fe = -i | dt'e ™ e" TF’), pole” = -i| dt'e" [F(t — t), pol 


— 00O 


It is convenient to change integration variables to f — t > —t: 


f= -i| dte-"|F(—2), pol (3.425) 
0 
The crucial trick is to write the following identity for the commutator: 
P gd 
LF). pol = ipo | dB F- iB) (3.426) 
The identity 1s easy to prove, 
P ð P ð 
d®’ — F(—t — if’) = —i dB' — F(_-t — if’ 
| ap SFr iB) =—i| aB Fr iB) 


= —i[F(—t — iB) — F(-9) 
so that 
? polF(—t — iB) — F(—2)] = -polh F(t)" — F0] 
= —[F(—2), pol (3.427) 


The expectation value of the current is the expression 


(3.428) 


CO B +}; / 
Jr) = Trif] = — | dte | ap'T| po oF ta iB) 200) 


a 7 


The reason that the trick identity is introduced is that one can now do the following 
manipulations to produce the electric field. First, the time development of F(—t) is given by 


F(-t) = | Pro —t)o(r) (3.429) 
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The scalar potential is time independent. The time derivative of the density is related to the 
current through the equation of continuity: 


oT l Pro) PE (3.430) 
— [rooy - j(r, —t) (3.431) 
Integrate by parts on the space variable, and thereby obtain the electric field Ep: 
= | @rice —t) + Vo(r) (3.432) 
— | Price —t)+ Ey (3.433) 


oo B 
n= | ae | aB'Tr po | @ rio, =- iB )Eog jale) 


Since the electric field is a constant, it can be removed from the integrals, and the conductivity 
is given by 


oo B 
Re(o,) = | dte™ | ap Tr| po [erie -t — By] 


The right-hand side appears to depend on the special variable r. This dependence is elimi- 
nated, as before, by averaging over the volume v of the sample. This manipulation just 
introduces the q — 0 limit of the current operator j (q) 


lim j,(q, ) = j,(t) = | ert À (3.434) 
q—>0 


The Kubo formula for the dc electrical conductivity is 
1 (* P 
Re(o,g) = a dte l aBTr| po [éri — Byl (3.435) 
0 0 


The result does not contain the frequency. The elimination of œ has been achieved by paying a 
penalty: there is a second integration, this time over the temperature-like variable B’. The 
second integration makes the results appear different than the prior result (3.385). The two 
results are identical for œ = 0. To prove this assertion, again introduce the representation 
In), |m}, which are exact eigenstates of the Hamiltonian H. The important matrix element is 


(n| jg(—t — iB’)|m) = (nje “EEB j eH mn) (3.436) 
= (n| jglmye "E -Em eb En—En) (3.437) 


so that 


Re(O,g) = Ref De e | "(n| Jglm)(m| jan) 


CO 
x | dte" E Enis) f dh'l E 9 (3.438) 
0 
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Both integrals can be done easily. For the real part of the conductivity, the only complex term 
is the ¢ integral: 


© . , 1 
R dt —it(E,, —E,,—is) — R es E _ 
e| | e e E —E, —is) no(E, — Em) 
p 
òlEn — Ey) | dp'e? Fn) = BEE, — En) 
0 
which makes the integral over P’ easy. The final result is 
TB —BE . 
Re(0,8) = 7 ue "(n| jalm) (m| j,|n)o(E, — Em) 
This equation is identical to the earlier result (3.396), since į = ją for q = 0. It has been 


proved that (3.435) is identical to the œ = 0 limit of the usual Kubo formula (3.385). In the 
limit of œ = 0, the longitudinal and transverse conductivities are identical. 


3.8.4. Photon Self-Energy 


A formula which is identical to the Kubo formula can also be derived from the photon 
self-energy. Consider the photon Green’s function (3.78), 


D (K, t) = —(T,A,(k, t)A\(—k, 0)) (3.439) 

and treat as the perturbation 
y= -2 |r ho A(r) + ents) AC) i (3.440) 
= -FEW A(k) + D AM): A(—k) (3.441) 


The A(k)* term contributes a self-energy of TI") = nge?/m. The other term in the self-energy 
comes from the n = 2 term in the expansion for the S matrix for the j - A interaction, 


1af, f ae nt (ee! 
Ty l ar, | dt (Ti (kK, TYm , T2)) (3.442) 
V Kk” 0 0 


x (T.Ay(k, JAIK’, t1)4,(K", 12)4,(—K, 0)) 


— 3 [ dt, f di, DN) (k, t — T1)Mm(k, T1 — 12)DPKk, T2) (3.443) 

where 
Mn(K, ti — 12) = — È (T, j(K, t1 YK 7) (3.444) 
-E (T, Hk, tnk, 12) (3.445) 


The Fourier transform of this expression gives the self-energy contribution as 


II; (k, i@,,) = -Lf dten" (T ji Vk, Tt) (k, 0)) (3.446) 
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Recall Eq. (2.186) for the dielectric function, in retarded or Matsubara notation: 


e(k, io,) = 1 — on, Ti io) + ne] (3.447) 
ea(k, ©) = 1 — an Ti o) + me (3.448) 


where II is the scalar part of the self-energy IT,, for homogeneous, isotopic systems: 
l 
II(k, iœ) = 3 > TI ,.(K; 10, ) (3.449) 


The long-wavelength dielectric function is given by the conductivity 


470(@) 


(3.450) 
10 


e(o) = lim Eret(K, @) = 1 — 


Comparing (3.448) with (3.450), the expression for the conductivity o(w) in terms of the 
retarded correlation function ITI(o) is 


oo) = - Ti) 4 me] (3.451) 


Since II is the current—current correlation function, Eq. (3.388) is rederived. 

The important point is reemphasized that, in evaluating the Kubo formula for the 
current—current correlation function, do not include the self-energy terms which arise from 
the vector potential A(r, t). These terms have already been included in the formalism, 
indirectly, by changing the external field E‘*” to the total field E. They do not reenter the 
formalism again except as internal interactions within the polarization bubble. 


3.9. OTHER KUBO FORMULAS 


There are other measurements besides electrical conductivity. They require additional 
correlation functions or Kubo formulas. Some of these will be derived here. The first is 
magnetic susceptibility. Next comes the derivation of transport coefficients such as thermal 
conductivity or thermoelectric power. 


3.9.1. Pauli Paramagnetic Susceptibility 


The magnetic susceptibility of a solid depends rather strikingly on whether the system is 
already spontaneously magnetized. If it is, then an additional weak, external magnetic field 
may not change the magnetization appreciably—there may be no linear response. This case is 
not considered, and it is assumed the solid is not in a magnetic state; it is not ferromagnetic, 
antiferromagnetic, etc. A magnetic state has long-range order, although there is usually short- 
range ordering, from the interactions, even in the absence of long-range order. 
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The perturbation term in the Hamiltonian for Pauli paramagnetism has just the inter- 
action with the electron spin: 


V = —m- He ™ (3.452) 
H(t) = Hye | (3.453) 
Mm, = SUo 2 Sia (3.454) 


where H(t) is the ac magnetic field and m is the magnetization, which is the sum over all spins 
Sj at position i with vector direction «. The other factors are the Bohr magneton pọ = 
le|4/2mc and gyromagnetic ratio g œ% 2. Another contribution to the magnetization is the 
orbital, or Landau, contribution, which is not discussed here. In Pauli paramagnetism the 


objective is to evaluate the magnetization, 
M,(t) = (m, ®©) = Xapp (3.455) 


for terms linear in the magnetic field. The derivation proceeds exactly as in the preceding 
section. In analogy with (3.381), the magnetization is a correlation function 
t 
My) =i] dd (img, VO) (3.456) 


— OO 


and the magnetic susceptibility is 


149) = | dt e= Em (e), ml) (3.457) 


Equation (3.457) is just the Fourier transform of a retarded correlation function. It may be 
evaluated conveniently in the Matsubara formalism. Define a correlation function 


B 
Xapi) = | dre" (T,m,(t)ma(0)) (3.458) 
0 


and (3.457) is just the retarded function obtained by the analytic continuation im, — œ + ið. 
The spin operators are often written in terms of raising and lowering operators: 


SO = S, iS, (3.459) 
SY = E Ch, Cu (3.460) 
k 


The three operators SP, SO, S, are used instead of S,,S,,S,. Of course, susceptibility 
functions can also be constructed using these operators. 

The correlation function will be evaluated for a simple example. It is the Pauli 
susceptibility of a free-electron gas at zero temperature. The answer is first derived using 
simple considerations. In a magnetic field H in the z direction, the spin-up and spin-down 
electrons have the following energies and occupation functions 


E(B) = Ép — HoH. m = nrp — oko) (3.461) 
E(P) = Sp + Hoo, n, = ne(Sp + HoHo) (3.462) 
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The net magnetization will be the difference between the number of spin-up and spin-down 
electrons. The occupation numbers are expanded in a power series in the magnetic field. 
Keeping the first two terms gives 


M; = Bo Xn} (p) — n, (P)] = -2p Ho Dae) (3.463) 
P p p 
M. 
Xp = lim 7 = 2119 X (Ey) = KONF (3.464) 
p 


The Pauli susceptibility %, is given by the square of the Bohr magneton py times the density 
of states Np at the Fermi energy. This density of states is the total for both spin components. 
The notation is confusing, since often the same symbol is used as the density of states for 
each spin component, so that the value is only one-half as large. 

The same result is obtained from the correlation function. The operator m, is given in 
terms of creation and destruction operators as 


m: = Wp X OCE Cy, (3.465) 
po 


where o = + is used to denote electron spin up or down. The Kubo formula for the corre- 
lation function for the static susceptibility (iœ = 0) is 


B 


Xzz = n| at »» 50! (T,Chg(t) Cio (t)Chg(0)Cy,/(0)) (3.466) 
0 pk,oo’ 


For the noninteracting system, the Hamiltonian is just the kinetic energy term, so that 
Ct (T) = thx Ch. and C,,(t) = e7«C,,. Then the operator Ch (t)Cyo(t) has no t depen- 
dence, so the t integral just gives B. The combination of operators can be averaged directly. 
Some care must be taken with the terms which have o = o’ and p = k. Using the fact that 
(n?) = (n,) if both refer to the same spin state, the answer is 


Xe = a0 on + X oonMnmt+ X oc) (3.467) 
po p#k,oo’ p,o4o’ 
2 
= wB Y o (np — n?) + (x on (3.468) 
po po 


The second term on the right vanishes, since there are as many up spins as down spins in 
equilibrium. The correlation function is evaluated, of course, in the absence of any external 
magnetic field. There remains the combination of operators 


er = HOB Do np(1 — ny) = ug (2e) = 215 > 6(E,) (3.469) 
po po Ep po 


This formula has exactly the same combination of factors as obtained previously, which 
confirms the result of the simple derivation. 
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Another method of evaluating the correlation function is to define the wave-vector- 
dependent susceptibility x(q, iw) by 


Xap(q, 10) = f dre™(T,m,(q, t)mg(—q, 0)) (3.470) 
m,(q) = Ho 0 ChagoCpo (3.471) 
m,(q) = BEG "at Cou + Cira Cpr) (3.472) 
m,(q) = BUG Coy — Chg Cor) (3.473) 


Evaluate y,, for nonzero values of q and iœ = 0 and formally take the limit q — 0 at the end 
of the calculation: 


Xz2(4, 0) = T 3) SS f a dī( T Chg s(T)Cp, Ca, «OVC, s (0) 
pk,ss’ 0 


For noninteracting electrons, this correlation function is nonzero only if s = s’, and then it is 
identical to the polarization operator P“)(q, 0) which has been evaluated previously: 


— nn 


p 3 7 Seta 


Now take the limit of q —> 0 on the right-hand side. There results dn-(€,)/06, in the 
summation over p. All methods give the same result for the noninteracting electron system. 


3.9.2. Thermal Currents and Onsager Relations 


There are two conceptual difficulties in deriving a formula for the thermal conductivity. 
The first is that one must put a temperature gradient on the solid. However, all of our thermal 
averaging assumes a constant temperature. Of course, the thermal conductivity is still defined 
in the limit that the temperature difference AT goes to zero, so that the correlation function 
can be evaluated at a single temperature. But the problem lies in formally deriving the 
correlation function without assuming that AT exists somewhere in the formalism. Another 
way to express this difficulty is to consider how one raises the temperature by heating the 
sample. The heating changes the energy of the system, so that one might try to calculate the 
time rate of change of the energy. But the energy operator is the Hamiltonian, and its rate of 
change is H = i[H, H] = 0, which does not help much. This problem has been well studied in 
nonequilibrium statistical mechanics (de Groot, 1952, 1962), and it has been found that the 
important quantity is the rate of change of the entropy. 

Another problem with the thermal conductivity is that there are several definitions of 
heat current. We shall mention three. The same definition of experimental quantities is 
obtained for the thermal conductivity, thermo-electrical power, etc., regardless of which 
definition is adopted. But one must be careful to do the right calculation for each definition of 
heat current. 

In linear response, there are current J; which flow as a result of “forces” X, on the 
system. These forces might be temperature gradients VT, or electric fields E = —VV, or 


178 Chap. 3 e Nonzero Temperatures 


concentration gradients which are expressed as gradients of the chemical potential Vu. Linear 
response assumes these are proportional: 


J=} Z-X, (3.475) 
j 


The coefficients Z are the measurable constants which are defined in terms of correlation 
functions. There are also Onsager relationships, which specify that Z” = Z™. A moment’s 
reflection will show that the Onsager relationship is not valid for any arbitrary choice of 
currents and forces. For example, if they are valid for a force V(1/T) = —(VT)/T’, then 
changing the force to VT, so that 1/T? is absorbed in Z, means the Onsager relation is no 
longer valid. There must be a criterion for choosing the forces and currents, which has been 
given in nonequilibrium statistical mechanics by de Groot (1952). In a nonequilibrium 
process, there is a net generation of entropy, so that 0S/dt > 0. Here S is just the part of the 
entropy which is generated irreversibly. By requiring that entropy generation be expressed as 


as 
75 3 J,- X; (3.476) 


then the Onsager relations are valid. There is not a unique set of currents and forces, and 
many possibilities satisfy (3.476). Each choice defines a different set of coefficients Z™, but 
each set obeys the relationship Z = Z®. 

The symbol J will refer to particle current. There are several definitions of energy 
current. The discussion will follow Barnard (1972). The first energy current which comes to 
mind is the one called the energy current Jp. For a free-particle system it is just the velocity of 
the particles times the energy times the number: 


jz = $. VpEpřp (3.477) 
po 
Jg = (jz) (3.478) 


Other definitions were given in Sec. 1.2 for particles which were not free. For a system with 
only these two currents, their forces are (Luttinger, 1964) 


J =J, X= -fvr -v($) 


l 


where e is the charge of the particle. The rate of entropy production is 


3 g. févr +v(E)] +Je- v(7) (3.480) 


and the linear response equations (3.475) are 
_ anf e H (12) l 
J, = -MG (É Vey + V5(=) | + MPVs (=) (3.481) 


Jea = -MG | EVpV + Vah) | + MPV (7) (3.482) 


With these definitions, the Onsager relation is M VE = Ms”. 
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The trouble with (3.477) is that the energy current is not the heat current. As discussed 
by Taylor (1970), the heat current should be 


Jo = Jg — yJ (3.483) 
which for free particles has the form 


jo = }_ Yp”p(Ep — u) (3.484) 
po 


so that the reference energy is the chemical potential u. The latter definition makes sense only 
for particles with a positive chemical potential, e.g., electrons in metals. If one takes an 
electron below the chemical potential and moves it down to the other end of the sample, one 
has moved energy. But when the electron arrives at the new location, it finds locally a filled 
Fermi distribution, so that the only states available to it are above the Fermi energy. But the 
energy gain required to increase its energy, to get it above the Fermi energy, must come from 
the surroundings. The energy gain from the surroundings must cool the locality, so that the 
electron has brought coldness with it. Only electrons which arrive with energy above the 
Fermi energy, and give energy to the surroundings, bring heat. 
With this choice of thermal current, the forces are 


l 
J =J, X; = —— VĮ 
l 1 T H 
l 
J, = Jo, X% =V| -= 3.485 
at, x= v(2) 2.85 
where jj = p + eV. With this choice, the rate of entropy production is 


as I _ 1 


and the linear response equations are 


l l 

J, = -zLap Vol + Lop Ve (=) (3.487) 
1 

Jou = = ELS Voi + + LV, (=) (3.488) 


where L? = LÊD, It is easy to show that these equations are consistent with the set (3.479) 
and (3.481). In (3.479) regroup the terms, 


e u e l I) 1c. 1 
VV +V¥(=) =ŹVW + mVn +uv(7) = mv + uV(=) 


so that dS'/dt becomes 


as 1__ 1 1 
Sg. [pva + iv(2)| tJe v(7) (3.489) 


1 _ l 
= -73:VÄ + Øs- 1): V(7) (3.490) 
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which is identical to (3.486). Similarly, in order to find the relationship between the coeffi- 
cients M®, and L®’, operate on the current equations (3.481) with the same regrouping 


n=-7 Mig Veit + (M? — uM Vs( 7 7) (3.491) 
Jey = = MPV pf 4 ME- uM V5 (7) (3.492) 


Next, consider the following combination of these equations: 
Jos = JE a — Wy = -1M — Msp Veit 
+ ME — pM? — uM + MSV A(z F) (3.493) 


This equation for Jp, now has the same form as the currents in (3.487) which gives the 
relationships 


LOD = MEP (3.494) 
Le) = 10%) = MO?) — pd (3.495) 
L°? = M@) — 2uMO?) + p? MOY (3.496) 


A third possible definition of heat or energy current is the choice 


Here one includes the fact that the potential V(r) may not be constant in the sample, so that a 
charged particle which moves will acquire a change in potential energy. Determining the 
forces which go with this choice of energy current is a homework assignment. 

The thermal conductivity is usually measured under conditions of no particle current 
J = 0. From (3.481), this condition leads to 


e H\ M (12)  /] 
aw +V(=) = Tan = (3.498) 
which for the energy current gives 
i (MCD 
— _ (22) _ 
n= v(i) 3.499) 


The M tensors are being treated as scalar quantities, which is permissible if all the forces are 
in the same symmetry direction. Since the thermal conductivity K is usually written as 


J; = -KVT (3.500) 


] (M?) 
—— | MC? _ 
K=5 m “fu (3.501) 


Equations (3.475) and (3.476) and the constraint dS/dt > 0 can be used to show that the 
right-hand side is always positive (Kubo, 1959). If there is no particle flow, then J o = Jg. The 
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thermal conductivity can be found from (3.487) in a similar fashion. The restriction that no 
current flows gives: 


lo- LY) _/1 
The heat current and thermal conductivity are 
] (LO)? 
— vl re?» 
Jo = v(7) p TUD (3.503) 
K= l LED (LO?) 3.504 
=| ~ Tap (3.504) 


It is easy to use the identities (3.495) to show that this definition is consistent with (3.501). 
The electrical conductivity is usually defined when there is no temperature gradient 
VT = 0 and no concentration gradient Vu = 0: 


e? 
o= al (3.505) 


The coefficient L!” may also be measured, since it is the thermoelectric coefficient. If a solid 
has a temperature gradient AT, no particle currents J = 0, and no concentration gradients 
(Vu = 0), then a voltage difference AV is measured which is proportional to AT and is given 
by 

l LOD 
~ eT LAY 
This equation is the definition of the thermoelectric coefficient S, which is also called the 
thermopower: 


AV = AT (3.506) 


AV 1 L) 
AT eTLOD 
The thermopower S may have either sign. The sign usually indicates the sign of the charge on 


the conducting particle. The correlation function is also related to the Peltier coefficient 
(Barnard, 1972). 


S= (3.507) 


3.9.3. Correlation Functions 


The coefficients M®’, and L, are correlation functions of current operators. They are 
expressed by formulas similar to those of the preceding section. One wishes to obtain the 
measurable currents J; by evaluating the expectation value of the operator j; of the same 
quantity: 


J; = (j) (3.508) 


The derivation is identical to the steps used to get (3.428). One introduces a term F into the 
Hamiltonian, and this term causes the change in the density matrix. In direct analogy with 
(3.428), the result is 


ore) B 3 
J, = | dte | ap'T| po zC- 0o] (3.509) 
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The only remaining step is to make the proper choice for dF’/dt. It is the time rate of change of 
the energy of the system. At least it is the dissipative part brought on by the transport and 
forces. It is the heat production or equivalently the irreversible rate of change of entropy S: 


aF dQ _ 0S _ 
y= gly pEi X; (3.510) 


The small j; symbols are for current operators, not their averages. Using this result for 3F /əðt 
in (3.509), 


0° B 
y=—5| de” | apts fo Ljet — Xabi 3.511) 
B Jo o 


Since the forces X, are constants, they may be removed from the integrals. Then J; is 
proportional to X;, and the constants of proportionality are just the transport coefficient: 


l 9 —St p / . | NDIN > 
w -z dre~ | dB'Tr [pp jre(—t — iB’), (0) (3.512) 


Equation (3.512) is a very simple result. The transport coefficient between force X, and 
current J; is just the current—current correlation function of j; and j; For example, the 
coefficients in (3.487) are 


1? p 
Lp = -5l are | dB'Tr [Poja(—t — iB’) jg] (3.513) 
0 0 
12 1 f° p 
Lip — -3l are | dB'Tr [Pojo (—t — iB’) jg] (3.514) 
0 0 
22) 1 f° p / 
Ls =-5| are | dB'Tr [Po jo.a(—t — iB) jop] (3.515) 
0 0 


The M“ coefficients in (3.481) are similar; the only difference is that jz is used instead of jo. 
Since jg = jg — pj, it is easy to show that the relations (3.495) are automatically satisfied. It is 
also easy to prove the Onsager relations. First one writes the correlation function in the |n) 
and |m) notation of (3.396): 


Z = Ye Pn (nl jilm) (m| jln)5(E, — En) (3.516) 
Z = mye Pn (n| jilm) (m| jy|n)S(E, — En) (3.517) 


These two expressions can be shown to be identical by interchanging dummy summation 
variables n and m and using E,, = Em. 

These correlation functions can also be evaluated by using the Matsubara formalism. 
One just evaluates the correlation function 


VAG 


B 
gh, BATIO (3.518) 


n 


by the usual S-matrix techniques. After a suitable result is obtained, one gets the retarded 
function by letting im, —> œ + id and then zero frequency by letting œ = 0. It was shown in 
Sec. 3.7 that these steps are identical to the evaluation of (3.512). 
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PROBLEMS 


1. Prove the following moments of the spectral functions: 


O o) = —(H, CJH, Cpo)) (3.519) 
f =~ ong(0)B(Q, o) = (A(q)[H, A(—q))) (3.520) 


Evaluate these moments for noninteracting particles and phonons. 


2. Take the Fourier transform of the correlation function (3.1). Define its spectral function as C(p, œ). 
Show that 


A(p, œ) = (1 + e-®*)C(p, œ) (3.521) 


What is the result when (3.1) is time-ordered? 


3. Let U(t) be the density operator p(q, t) of wave vector q. Prove that 


s(q) = | 2 n(0)5(4, ©) (3.522) 


—Xx 


where S(q) = (p(q)p(—q)) is the pair distribution function and S(q, œ) is the spectral function for the 
density—density correlation function. 


4. At zero temperature, 


(a) Show that n, = [°.(d@/2n)A(p, @). 
(b) Evaluate this integral for a Lorentzian: A(p, œ) = 21°, /[(@ — E) + Ts. 


(c) Plot np vs. §, for cases (1) Tp = To = constant, and (2) Fp = Tolépl/ JT + ES. 


5. Let ci be a creation operator describing a spinless boson whose number is conserved; this model is 
sometimes applied to “He atoms if one can ignore the internal structure (excitations, degrees of 
freedom). The particles will have a chemical potential u. Define 


G(p, t) = —(T,ePPRP-C, (x) Ch(0)) (3.523) 


(a) Show that G (p, io) = 1/(iw — Ep): 
(b) What is the definition of the retarded function? 
6. Prove the following general theorem for particles which are fermions or bosons: 
OO 


d | 
Galp, t) = —iO(¢) l =— Ap, oe (3.524) 


7. Evaluate the thermodynamic average for the case where the operators represent bosons 
(Salona Eaha) (3.525) 
p 


by doing it (a) directly and (b) using Wick’s theorem. Both should give the same answer. Terms with 
p = k have to be treated carefully. 
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8. Use the rules for constructing diagrams to write down the two-phonon self-energy diagrams of the 
electron: E” (p, ip,). 


9. The two diagrams in Fig. 3.12 contribute to the electron self-energy and arise from the electron— 
electron interaction. Use the rules for constructing diagrams, and write down these contributions in the 
Matsubara formalism. However, do not evaluate them, i.e., do neither the frequency summations nor the 
wave vector integrals. 

10. How is (3.215) altered if the internal line is a Coulomb interaction? 


11. Derive the summation formulas (3.217) and (3.218). Also evaluate 


l 
-Y Dg, ia,)D (k, im, + ign) (3.526) 


ID, 


12. Evaluate the Matsubara frequency summation for 


1 
B> GO (P, ipn + idm)D(A, 14m) (3.527) 
ldm 
13. Prove that 
2 
Qo = lim, > > e”" InfG(p, ip, )] (3.528) 
+01 Bopp, 


where G is for electrons and Qp is the free-electron part of the thermodynamic potential (Luttinger and 
Ward, 1960). 


14. Consider the terms W, for the electron-phonon interaction. How many times do each of the terms 
appear: UZ, Už, UU, U,U3? 


15. Evaluate W,(p, t) in (3.314) using V(t) as the electron—electron interaction. Which terms would 
not make sense if one tried to write this as exp(F)? 


16. Show that at nonzero temperatures the real part of the conductivity can also be written as 


_ e-bo poo 


1 
Relo = —| dre" Na, dila 0) (3.529) 


17. Find the forces X; which go with the currents J, Jy, in (3.497). What are the Kubo formulas for the 
transport coefficients? 


18. Show that the relations (3.495) are consistent with (3.512). 


FIGURE 3.12 
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19. Show that the second-order phonon self-energy, from the electron-phonon interaction, may be 
written as II(q, œ) = M¿P® (q, œ): 


dP 1 l 
Og. aw =2 |n r 
P (q ) (27) p © — Ep + Ep+q +16 © + Ep — Ep+q +16 


Evaluate this expression by doing the wave vector integrals. Assume a free-electron gas of Fermi 
momentum kp at zero temperature. Give both the real and imaginary parts of P®. 


20. Derive the change in the thermodynamic potential when the electron self-energy is E(n) = NXp. 
21. Derive the change in the thermodynamic potential when the electron self-energy is L(y) = n? Ep. 


22. The thermodynamic potential ©, internal energy U, and entropy S are related by Q= 


U — TS — uN, where 
dQ 
S = -(5) | (3.530) 


dS 
C =r() (3.531) 
4 aT) y 


Derive the expressions for S, U, and Cp for the free-electron gas. Show that the specific heat is 
proportional to T at low T, and find the constant of proportionality. 


23. Derive the relationship between the total internal and external electric fields by solving Maxwell’s 
equations. 


24. Show that Wick’s theorem is invalid when Họ contains particle—particle interactions. As an 
example, evaluate the correlation function W below for a system of two states. Do it exactly and also by 
Wick’s theorem. 


Hy = 3 E CÌC, + Unn, (3.532) 
a=] 
W = XTC) CaCl) Cpl) (3.533) 


25. Derive the equilibrium real-time Green’s functions in (3.327) starting from (3.322). 


26. In one dimension find the Wigner distribution function for the simple harmonic oscillator of mass 
m and frequency Wp. In center-of-mass coordinates show that its spectral function is given in terms of 
Laguerre polynomials 


A(k, œ; X) = 4re™^ E (1) L,2A Sio — oln + 4] 


2 
A=% 4 RP, P= 
[2 m 


27. Prove the following identity: 
OO d (’ 


Zelk, 0; T) = — | oo Ti © o; T) — x*(k, wo’: TJ] 


This result is easy to prove in equilibrium. Can you also prove it for systems out of equilibrium? 


Chapter 4 
Exactly Solvable Models 


Every many-body theorist should be knowledgeable about the available exactly solvable 
models. First, there are not many of them. Second, they are useful for gaining insight into 
many-particle systems. If the problem to be solved can be related to an exactly solvable one, 
however vaguely, one can usually gain some insight. 

Several models are presented here to introduce the discussion of Green’s functions. 
Usually, the model is first solved without Green’s functions and again with them. Models 
which are exactly solvable may be solved by a variety of techniques. 

The models presented here are only a small sampling of the possible exactly solvable 
ones. Others have been presented by Schweber (1961). Many one-dimensional models may be 
solved exactly (Lieb and Mattis, 1966; Baxter, 1982; Mattis, 1993). 


4.1. POTENTIAL SCATTERING 


A very simple problem is an impurity potential V(t) in an otherwise free-particle system. 
All other interactions are ignored except that of the free particles with the impurity, which is 
assumed to be at the origin. The potential is assumed to have no internal structure: spin, 
excited states, etc. It is a simple function of position r of the particle from the origin and is 
spherically symmetric. 

The wave functions , and energy £, for each particle may be obtained by solving a one- 
electron Schrödinger equation: 


h2 
AW, = 3 + re, = &Wy (4.1) 


The solution may be accomplished by numerical means if the Schrödinger equation is not 
solvable analytically. 


In many-body theory, the impurity problem is usually encountered as a scattering center. 
The free-particle states are plane waves, 


1 


y(r) = i 2 Choe (4.2) 
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The spin index o is dropped, since spin effects are unimportant. The Hamiltonian is expressed 
as operators 


| 
H =Y &CtCk +- F Viw Ch Cy (4.3) 
k V kk’ 
Nia = [rro k-k) — V(k — k’) (4.4) 


The last term is the potential scattering of the free particles. The object is to diagonalize the 
Hamiltonian (4.3). Of course, the solutions are given by (4.1). However, the problem is not 
entirely solved. Since the equation is second order, it has two solutions. These may be chosen 
as the ingoing and outgoing waves. Alternately, these two may be combined to give standing 
waves. These choices must be related to the scattering problem implied in (4.3). 

The discussion will benefit from the knowledge that the solution is of the form (4.1). 
Consider the boundary conditions on the wave function. The integral equation for the wave 
function is: 


VaCt) = o4(0) y Out) 2 fas OE )V rye) (4.5) 

phr) = =e" (4.6) 
27,2 

E; = E (4.7) 


This form of the integral equation is valid for the free-particle states with energy &,. The 
Schrödinger equation (4.1) may also have bound states. They shall not be discussed explicitly. 
They obey an equation similar to (4.5), but the energy is changed to the binding energy 
E} —> —Ep (Eg > 0), and the >, term on the right is absent. 

The first step is to prove that (4.5) is equivalent to (4.1). Operate on both sides of the 
equation by Hy — & 


= (r) 


— Ey 


x lone 


(Ho — & Wy = (Ho — &) OK + ys 


and use the fact that 
h? o» 
CT a ET, (4.8) 
m 
to get 
Hy = oe =- E eO [Pr OOO) (4.9) 
k’ 
The completeness relation for the summation over the set of states is 
Doe Moy lr) = 5(r — r’) (4.10) 
k’ 
(Ho — & Wx = -V (rhil) (4.11) 
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If the potential is transferred to the left, the final equation is 
(Ho +V— €,)W,.(r) = 0 (4.12) 


which is the desired answer. The above equation is precisely (4.1). It has been shown that 
(4.5) is equivalent to the usual Schrödinger equation. Next discuss boundary conditions. The 
differential equation (4.1) is second order, so there are two independent solutions. As a 
mathematical problem, these two solutions may be combined in any possible way, depending 
on the choice of boundary conditions. As a physics problem the two solutions are combined 
to give a desirable physical situation. One choice is to have the wave function a standing 
wave. This choice leads to the reaction matrix equation. Other choices are to have the wave 
function an incoming wave or an outgoing wave, which leads to matrix theory. From 
scattering theory and causality a natural choice is the outgoing wave. It has the asymptotic 
form exp(ikr)/r at large distance. This choice of phase [rather than exp(—ikr)/r] arises from 
the physics convention that the time development is exp(—iœt). The integral equation (4.5) is 
a convenient starting point for this discussion of boundary conditions, since the various 
choices of standing, outgoing, or ingoing waves are determined only by the complex part of 
the energy denominator. The factor is €, — £y, for standing waves, so that the principal part is 
chosen for the denominator. The factor is €, — £y + id for outgoing waves, and £; — €p — id 
for ingoing waves. 


4.1.1. Reaction Matrix 


Here the energy denominator is chosen to be real and given by the principal part. The 
free-particle Green’s function is then defined as 


G,(k,r—r)=P = 4.13 
of ) 2 Ez — Ep V w Ek Ep ( ) 
dk! ik’ - (r—r’) 
(2T) & — & 


The last step on the right takes v —> oo and changes the summation into an integration over 
wave vectors. The integral is standard and gives 


, cos[k|r — r'|] 
G)(k, r — r ) = rp(k) ——— l 
a(k, r= r) = mph) y (4.15) 
A factor p(k) has been introduced which is the density of states of the particles: 
Ëk mk 
p(k) = eae — Ep) = hone (4.16) 


For particles with spin, this factor is the density of states per spin configuration. In electron 
systems, for example, the net density of states is twice this result when spin degeneracy is 
considered. The Green’s function may be expanded as a function of r and r’: 


Go(k, r —r’) = mp(k) 22I + DPG -P Yy mir) (4.17) 
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where the notation r_ and r.. means the lesser and greater of r and r’, respectively. The j,(kr) 
and 1,(Ar) are spherical Bessel functions of the first and second kind, and the P,(cos 0) are 
Legendre functions. These results may be put into the integral equation (4.5): 


W(t) = y(n) + [dro r- rT) (4.18) 


At this point it is convenient to reduce the integral equation to angular momentum compo- 
nents /. A plane wave may be expanded as 


er — Y (21 +1) Pk Pjk) (4.19) 
i 
A similar expansion is used for the actual wave function: 
W(t) = 21 + 1) Pk PR Cr) (4.20) 
i 


The radial function R,(kr) is the quantity to be determined. It satisfies a radial Schrodinger 
equation of the form 


2m 


K? [1 3 ,Əə3R I+) 
hirz- 4 R] +170) - eR =0 (4.21) 


which does not determine the boundary conditions. They are obtained by substituting these 
forms into (4.18). Each angular momentum component is selected by multiplying the 
equation by P,(k +F) and then integrating over all spatial angles. One uses the fact that 
A 4n 
dQ. P(k*?)P_(r* p) == 
[aa PAPE D 
to reduce the equation down to one which involves only the same angular momentum 
component: 


SmPI(k P) (4.22) 


RE) = jkr) + 472 p(k) | dj kr mikr VR (Kr) 
= j (kr) + 4n? p(k) [no | earje YR Ce) 


+ ji) | dy be HDR | (4.23) 


r 


It is important that the potential is spherically symmetric. Otherwise the scattering term would 
mix angular momentum components, which usually makes the equation much harder to 
solve. 

The solution is examined in the limit as kr —> oo. From the differential equation (4.21) it 
can be shown that the radial wave function must asymptotically approach the value 


Ci(k) 
kr 
The prefactor C; is determined below. The asymptotic limit of the integral equation (4.23) is 

lim R,(Ar) —> j,(7r) + Dj(k)ni (Fr) 
EN sin(kr — ln/2) | D, cos(kr — In/2) 
kr kr 


Jim R,(kr) > sin |x + ò (k) — = (4.24) 


(4.25) 
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where 
D,(k) = 4r? p(k) | ? dri (kr) V (DR (kr) (4.26) 


The potential V(r) is assumed to be of short range: It falls off faster than r7? at large 
distances. This assumption is necessary for the integral in (4.26) to be well defined. The 
scattering is then described by a phase shift 6,(k) which depends on angular momentum and 
wave vector. The two asymptotic expansions (4.24) and (4.25) must be identical, which is 
accomplished by setting 


D, = —tan(8,) = 4x p(k) | r drj(kr)V (r)R (kr). (4.27) 
0 


The complicated integral just defines the tangent of the phase shift. Now (4.25) becomes 


Jim R(kr) > 7m l [cos(ò) sin(kr — ln/2) + sin(d) cos(kr — In/2)] 


cos(d) 
sin(kr + 6, — [nt /2) 

a t 4.2 

kr cos(6,) (4.28) 
The normalization coefficient in (4.24) is 
1 

= 4.2 

Ci cos(5,) (4.29) 


When solving the radial wave function (4.21), the solution is obtained which is well behaved 

at the origin. This solution is followed outward in r until the region is reached where V(r) ~ 0 

and the centrifugal barrier 47/(1 + 1)/(2mr’) is small. Then the solution has the form (4.24) 

with C, = 1/cos(6,). The wave function is now properly normalized at large r, and by 

following it back toward the origin it is normalized everywhere. These steps provide the 

proper solution to the scattering equation (4.5) which was the starting point in the calculation. 
The reaction matrix is defined as the quantity 


Rux = [Pree VN (0) (4.30) 


It may be expanded in angular momentum states by using the expansions for the wave 
function and the plane wave. The angular integrals give 


Ryp = 4T 221 + DP k KR AK, k) (4.31) 
l 
where the radial integral is 
Rk’, k) = l r drj)(k’'r)V (r)R,(kr) (4.32) 
0 


It is defined for the general case where k’ + k. Of course, if they happen to be equal, then the 
answer is just (4.27): 

tan(5;) fh” tan(,) 
An? p(k) 2mk 
where the density of states is p(k). The reaction matrix is only proportional to tan(6) for the 
diagonal terms where k’ = k. Otherwise, one must do the integral in (4.32). 


R(k, k) = — (4.33) 
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The reaction matrix obeys an integral equation which is deduced by putting (4.30) into 
the integral equation (4.5) for W,(r) 


Vin R 
Ry = Vier +P (4.34) 


This integral equation is frequently found in scattering problems. Now it is solved easily. For 
each angular momentum state, the one-particle radial equation (4.21) is solved with the 
boundary condition that the asymptotic limit goes to (4.24). This wave function is used in 
(4.32), which determines the answer. It is a simple procedure to solve (4.34). The important 
point is that the wave functions must be normalized correctly. The designation of principal 
parts on the energy denominator in (4.34) specifies the unique way this normalization is to be 
done. 


4.1.2. T Matrix 


The other common choice of boundary conditions uses outgoing waves, which is 
accomplished by adding an infinitesimal complex part ið to the energy denominator in (4.5). 
The integral equation is 


W(t) = Ql) + |rok, r — PV ye) (4.35) 


where the Green’s function for outgoing waves is 


BK e` -(r—r’) 


G)(k,r—-—r -| 4.36 
ol ) (Qn)? E — Ep + ið (4.36) 
This integral is evaluated to give 
eR Tp(k) 
— — L = i sin(kR 4.37 
Go(k, R) p(k) TR ER [cos(kKR) + isin(kR)] (4.37) 
The cosine term was evaluated earlier in (4.17). The sine term is 
sin(k|r — r’ A AN (EA: 
TEED L Sent DPE Pye) (4.38) 
— ] 


These results can be put into the integral equation (4.35) to give an equation for the wave 
function. Averaging over angles gives for each angular momentum state: 


R,(kr) = ji(kr) + 41° p(k) | r? dr ilke lkr) — HY VORA’) (4.39) 


The radial part of the wave function is denoted by R. It will be a different wave function than 
found for the standing-wave boundary conditions. The difference between the two is only a 
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different choice of normalization coefficient C,(k) in (4.24). This different choice is what is to 
be determined. The above integral equation may be rewritten as 


Rir) = jh) + An? p(k) | dnika’) 


—4n*ip(k) M r° dr jy ke’ WV (r)Ri(kr | 
0 


+ 4n2p(k)n,(kr) | dv jkr! VNR) (4.40) 


Then one can take the limit kr — oo and obtain 
jim Rir) > flr) — iD] + Dim kr) (4.41) 


Dk) = 4r? p(k) | Pdr (kr) V (DRC) (4.42) 


This equation has to have the form (4.24) because the radial wave function R (kr) also obeys 
the differential equation (4.21). The result (4.41) has that form if 


D i(k) = —e® sin(5,) (4.43) 
so that the factor multiplying 7,(Ar) is 
1 —iD(k) = 1+ ie sin(5,)) = 1 + a (e — 7 #1) 
=e"! cos(ò;) (4.44) 
For the asymptotic limit 


am, R,(kr) > e®'[ j,(kr) cos(8;) — n,(kr) sin(8;)] 


id 


— S [sin(kr — 11/2) cos(8,) + cos(kr — In/2) sin(8,)] 
ið; 
= — sin(kr + 8, — In/2) (4.45) 


which is indeed the right form, (4.24). For outgoing wave boundary conditions, the proper 
choice of normalization coefficient is 


C (k) = 8A (4.46) 


One should proceed by solving (4.21) and insist that it have this asymptotic limit. The easiest 
way is to first solve it for a real wave function and make this real wave function go to 
[sin(kr + ô — In/2)]/kr [which determines 6,(k)]. Then one can multiply the wave function 
everywhere by the phase factor exp[i6,(k)]. 
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The 7 matrix is defined as 
Tex = | PrO 


= 4n F` (21 + 1)P (k KT (KE (4.47) 
l 


T(k’, k) = | r?drj(k'r)V(r)Ři(kr) (4.48) 


The original integral equation for the wave function (4.35) may be used to generate the T- 
matrix equation 
k'k Ikik 


Tek = Wr tP} 


kki kik 4.49 
k; Ek — Ek, + id ( ) 


This equation is often encountered in scattering problems. The solution is easy. One solves 
Schrédinger’s equation (4.1) for the wave function ,(r). Only the radial part of the wave 
function is difficult, since one must solve (4.21). The solution is normalized by insisting that 
the radial wave function have the form (4.24) at long distance from the potential. The 
coefficient is C, = exp[i6,(k)]. These radial solutions are used in (4.47), which then gives an 
exact solution for the 7matrix equation. This result differs from the reaction matrix result 
only in the choice of the coefficient C, which multiplies the radial wave function. The angular 
momentum components of the two scattering functions are related by the ratio of these 
normalization coefficients: 


R(kr) = e® cos(5))R,(kr) (4.50) 
T,(k’, k) = e® cos(8)R(K’, k) (4.51) 


The T matrix is a complex quantity since it was defined with a complex exp[id,] phase factor. 
The diagonal T matrix is 


, 1 , 
T,(k, k) ei sin[S,(k)] = -z sin[ð;] (4.52) 


~ An? p(k) 


We repeat again that this simple form is valid only when k = k’. The imaginary part of the 
diagonal T matrix is 


2 
sin’ (6,) 
— — 4.53 
l 9 
—Im[7,,] = ——~ ’ (21 + 1) sin*[6,(-)] (4.54) 
which is related to the total scattering cross section for the potential: 
An 
C= Le! + 1) sin’[8,(4)] (4.55) 
One can also prove the optical theorem 
dk > 
—2 Im[Tyx| = 21 3 [Tkl ÒlEr — Ep) (4.56) 
(27) 
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It is important to realize that the reaction matrix is not identical to the real part of the T 
matrix. Either of these two different quantities may be selected as the energy shift of a particle 
interacting with a potential. 


4.1.3. Friedel’s Theorem 


There are several important theorems involving phase shifts. These theorems are 
conveniently proved by letting the solid have a spherical shape of radius R, with the impurity 
located at the center of the sphere. The distance R 1s very large, and eventually the limit is 
taken of R — ov. A finite size sample is used since we are going to count nodes in the wave 
function. Start counting at the surface of the sphere and move inward. It is assumed, for 
convenience, that the wave functions vanish at the surface of the sphere. 

In the absence of the potential, the solutions which are regular at the origin have a radial 
part 7,(kr). The condition that they vanish at the surface of the sphere is 


I(k,R) = 0 (4.57) 


Since small values of / are important, say / < 5 for most applications, and since R is very 
large, an adequate approximation at large argument is 


sin (in — 4 (4.58) 


Ji (k,R) —> 5 


l 
kR 


The condition for the vanishing of the wave function at the surface is 
l 


For each value of /, there is a solution for each additional integer n, where each additional 
integer has a solution with an extra node in the wave function. Similarly, the continuum wave 
functions (i.e., for €, > 0) in the presence of the impurity at large r is 


Cl. It 
R,(kr) > T sin(x +ô — 4 (4.60) 


The solution is obtained for each k, which satisfies the equation 
l 
k ,R + 0/(k,) = (r + s) (4.61) 


Count the additional particle states in the presence of the impurity. The number of states dn 
between k and k + dk is 
dn R ld Ô; 


dk v m dk 


The first term R/7 is just what one would have without the impurity. The extra states ôn from 
the impurity are given by the formula 


(4.62) 


dön _ 1d, 
dk ndk 
The quantity on the right (d/dk)(0,(k)/1) is interpreted as the change in the number of 


particle states caused by the impurity. For example, if the potential is repulsive so that 
particles are pushed away from the impurity region, then d6,/dk will be negative. Similarly, 


(4.63) 
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an attractive potential will draw particles inward, so that dò;/dk is positive. Many potentials 
have d6,/dk positive for some wave vectors and negative for others. The quantity d6,/dk is 
the change in the density of states for each angular momentum state, each magnetic quantum 
number m, and each spin quantum number m,. The total change in the density of states is 
obtained by summing over all these quantum numbers: 


d d Òm m 
LN =— m , 
dk dk mm ( T ) (4.64) 


In a metal, the electron states are occupied up to the Fermi wave vector kp. The Friedel sum 
rule (Friedel, 1952, 1953) is obtained by integrating up to the Fermi wave vector, 


fr dN Stm,m, (kr) 
z- fan) = z (mast 4.5 


mml n 


where Z is the charge on the impurity. It is assumed that the phase shift is zero at k = 0. 
Levinson’s theorem is that the phase shift is zero at k = 0 unless there are bound states, in 
which case it is = times the number of bound states. The Friedel sum rule is a statement of 
charge neutrality. In a static electron gas, there are no long-range Coulomb potentials of the 
form r—'. Instead, the electron gas is displaced in the vicinity of an impurity charge. The 
displaced electronic charge exactly cancels the impurity charge. For example, an impurity of 
positive valence Z has electrons drawn in, until an extra Z of electrons, or eZ of charge, 
surrounds the impurity. This extra charge is called a screening charge. It is shown in Fig. 
4.1(a). The phase shifts are calculated for the screened potential of impurity plus screening 
charge. If the potential is repulsive, electrons are depleted around the impurity, so that charge 
neutrality is maintained. This depletion is shown in Fig. 4.1(b). The Friedel sum rule is a 
statement of charge neutrality: the change in electron charge around an impurity is exactly 
equal in magnitude and opposite in sign to the charge of the impurity. A neutral impurity 
would have a Friedel sum of zero. The sum rule does not mean that all the phase shifts vanish. 
Generally, some are plus, some are minus, and all vary with the wave vector. Only the 
summation at the Fermi energy is zero. Another way to express that the screening charge 
equals the impurity charge is 


OO 
Z= an| rdr[p(r) — po] (4.66) 
0 

where pọ is the equilibrium charge density in the metal. 

The charge densities in Fig. 4.1 are schematic. The method of actually calculating these 
curves is discussed in Chapter 5. All such curves, when calculated correctly, have the 
oscillations in the charge density at large distance. These variations are called Friedel 
oscillations. Their magnitude has been exaggerated in Fig. 4.1, since they are small in 
amplitude. But they occur in real solids and have been observed by several techniques. The 


(a) Z >O 


pir) plr) (b)Z <O 


r r 
FIGURE 4.1 Electron density p(r) when an impurity charge of Z is put at the origin. (a) For Z > 0, (b) for Z < 0. 
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Friedel oscillations may be derived by taking the asymptotic limit of the change in charge 
density, 


p(r) — si)? — 16,17] (4.67) 


‘aes 


Jim [p(7) — Pol = 2 QI + Di dkisin” (kr + 8, — In/2) — sin? (kr — In/2)] (4.68) 


(27 ae 2 
and 
[sin (kr + 8, — In/2) — sin’ (kr — In /2)] 

= $ [cos(2kr — Im) — cos(2kr + 28; — In)] (4.69) 


The wave vector integral is difficult because the phase shifts 5,(k) depend on k. An 
approximate answer is derived by writing this dependence as 06,(k) = 6,(k-) + 
(k — k)(dd/dk). The k integral is then elementary and the r-dependent part is 


(—1)' N dk cos(2r — cos| 2e( + =) + 26, — 2kp = 


I 
= — CD {sin(2kpr) — sin[2kpr + 285,(k-)]} (4.70) 


and the change in density is 


tim [p(r) — Pol = — Yel +1) sin[8(kp)] x cos[2ker + ò;(kr)] + (5) (4.71) 


At large distances, the changes in charge density oscillate with a period of 2k, and decrease in 
amplitude as r-*. This asymptotic equation is independent of the nature of the impurity. The 
impurity determines only the values for the phase shift 5,(x;). 

The Friedel sum rule is believed to be exact in real systems. Langer and Ambegaokar 
(1961) have shown it to be valid even in an interacting many-particle system. If one knows the 
exact impurity potential and the exact screening charge profile, the Friedel sum rule will be 
valid. In practice, a calculation of an impurity potential is done by pseudopotential or other 
means, and it is screened by a good dielectric function. The phase shifts are evaluated 
numerically and it is found that the Friedel sum rule errs by a small percentage. This error is 
no fault of the rule but the choice of potential or screening function. In fact, the theoretical 
system is not charge neutral. Usually the potential is adjusted slightly, for example, by altering 
a screening length, to force the Friedel sum rule to be satisfied. Then one has a consistent 
model of the neutral system, and the phase shifts are probably reasonably accurate. 

Quite often the phase shifts do not depend on m; or m, but they strongly depend on /. 
Then one can represent the answer for electrons as 


Z= pa + 1)8;(kr) (4.72) 


which is the way that it is usually presented. The factor of 2 is spin degeneracy and (2/ + 1) is 
orbital degeneracy. The quantity (2/7)(2/ + 1)6,(A;) is interpreted as the amount of screening 
charge in the angular momentum channel /. 

In real solids, the impurity may occupy the site ordinarily occ upied by a host atom. Then 
the normal host atom would have its own set of phase shifts 6” which characterize the 
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normal metal. For example, if Mg** is a substitutional impurity in sodium metal, it could be 
at a site where a Na* is usually placed. The Na* ion has its own phase shifts 6, and the 
Mg? ion has its own phase shifts 6). If one repeats the arguments leading to (4.72), the 
Friedel sum rule is given by the difference of the two sets of phase shifts: 


Z == E2 DBP Ee) — 8) (kp) (4.73) 


Note that the host phase shifts do not obey the Friedel sum rule. 

Another exact result may be obtained in terms of phase shifts. It is the total energy of the 
impurity as caused by its interactions with the surrounding electrons in a metal. The theorem 
is due to Fumi (1955) and relates the total energy E; to an energy integral over the phase 
shifts: 


EF 
0 Imm, n 
2 pkr 
= -=| rak y OM (4.75) 
M Jo Imm, 


The energy integral starts at the bottom of the conduction band. It is assumed there are no 
bound states. The presence of bound states will change the answer. One can also use the 
second form of (4.75), which includes just the sum over wave vector states for the conduction 
electrons. 

Fumi’s theorem is proved in the following way. The Hamiltonian is the kinetic energy 
plus the potential energy. Well outside the impurity, or its screening charge, the potential is 
zero, and the Hamiltonian only has the kinetic energy term. Here the energy must just be the 
summation of the kinetic energies of all the particles, which is a discrete summation over the 
states allowed in the sphere of radius R: 


hk? h? 2 


The desired quantity is actually the change in kinetic energy, so the result without the 
impurity is subtracted: 


h2 
=e „a,l +n — 8) — [n +4 Dr} (4.77) 
h2 
x — mR nm, k,,5)(K,) (4.78) 


Next let the radius of the sphere go to infinity, which changes the wave vector summation to a 
continuous integration. This change and the result are 


k 


Rn T Jo 
h2 kp 
B=- nt | kdk®,(k) (4.80) 
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which proves the theorem for the conduction band states. The result can be extended to 
include bound states. They rarely occur in three dimensions, but are common in lower 
dimensions. 


4.1.4. Impurity Scattering 


How are Green’s functions used to describe impurity scattering? They do not work very 
well for describing the scattering of a particle from a single impurity. In Dyson’s equation, the 
change in energy of the particle is calculated from the interactions. One impurity in a large 
system of volume v changes the energy of a particle only by order 1/v. This result assumes, of 
course, that the particle does not get bound by the impurity. A free-particle state has a density 
Ibp]? = 1/v, so that a local impurity changes the energy only by 1/v. To alter the energy, one 
must add N; impurities, where N;/v = n; as v —> oo. Then the particle energy gets changed as 
a function of the concentration n; of impurities. The isolated impurity case should be studied 
by the following procedure: one solves the self-energy as a function of n; and then takes the 
limit n; > 0. Terms of order O(n;) are from an electron interaction with single impurities. 

The potential which scatters the electron is taken as a summation of impurity potentials: 


V(r) = V(r —R;)) (4.81) 
j 
The Fourier transform is V (q): 
V(a) = p,(a)V (a)p(—a) (4.82) 
where p(—q) and p,(q) are the particle densities of electrons and impurities, respectively: 
PQ) = © Crrq.o Ce (4.83) 
pd) = 2 ea’ (4.84) 


The expansion of the S matrix encounters products of the impurity density operator: 


Fi(G1, q2, >» +> Gy) = (P:(q1)P:(42) -+ P;(4,,)) (4.85) 


These products must be averaged. This average is not taken over temperature. Instead, it is 
averaged over the possible positions which the impurities may have in the solid. Usually it is 
assumed that the impurities are randomly located and that there is no correlation between their 
positions. This method of impurity averaging was suggested by Kohn and Luttinger (1957). 
Their result will be derived by first examining the results for a small number of operators. The 
first term is n = 1: 


fq) = (x on) = Ngo (4.86) 
J 


If the R; are located randomly, this sum is zero unless q = 0, and then the sum gives the 
number of impurities N;. The second case is n = 2: 


(res ae | = (x eiwvern + eft Rtn | (4.87) 


i i=j iA] 
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The first term is zero unless q, + q, = 0, since it is the same average as in f}. The second 
term is zero unless both q, and q, equal zero: 


hadı; q2) = Nj9q,+4,=0 + NAN; — 1)3q,=059,=0 (4.88) 
The general result found by Kohn and Luttinger is 
Jaldi: d2; - - -> Gn) = NiOz,q-0 + NN; — 1) 2 Ox", q)=083 


+ NAN; — 1N; — 2)8y,_,55,_,825- +: (4.89) 


q=0 


j=m+14j= =0 


In terms with products of several delta functions Oy one must take all possible combi- 
nations of distributing the q; among the delta functions. The final result 1s obtained by 
generalizing the low-order examples above. The result for n = 3 is 


Hdi; q2, 93) = N; Òq, +q+q; + N;(N; — 1)(Sq, %q, +4; + Sq, 9q, +a; +09 q Ôq +a) 
+ NN; — 1)(N; — 2)8q, Sq, 5q, (4.90) 


The next step is to learn to draw Feynman diagrams with these impurity averages. Since 
N; > 10!° for real systems, this expression is approximated by 

Saldi ++ In) = Nidz, +N; Sy, 35, +N; Sy, Sz 55, Ho (4.91) 
There occurs the combinations of the function 

NOs, (4.92) 

This factor is interpreted as the particle scattering from a single impurity. The factor of ds, 
states that momentum is conserved for the particle while scattering from the impurity. In Fig. 
4,2(a), the impurity is represented by an x, and the solid line is the particle. The dashed lines 
represent impurity interactions V(q,). Momentum conservation requires the momenta from a 
single impurity to sum to zero. Diagrams with two factors of (4.92) involve the scattering 
from two impurities, as shown in Figs 4.2(b) and 4.2(c). 

In Fig. 4.2(b), the scattering from the two impurities interfere—the momentum lines 
cross each other. Such interference occurs when the impurities are nearby, and such diagrams 
are important when the concentration of impurities is large. In Fig. 4.2(c), the two scattering 
events are disconnected, since the interaction lines do not overlap. Each connected diagram 
gives a separate self-energy contribution. Edwards (1958) has shown that one has the Dyson 
equation 


1 
— Ép — X(p, ipp) 


where the self-energy X(p, ip,) from impurity scattering contains all the connected contri- 
butions. The diagram in Fig. 4.2(b) is connected, so it contributes a term to X. The self-energy 


Gp, ip,) = (4.93) 
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FIGURE 4.2 
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FIGURE 4.3 


diagrams for scattering from a single impurity are shown in Fig. 4.3. They are the terms which 
are important when the concentration of impurities is small. The first self-energy diagram in 
Fig. 4.3 gives a contribution 


ZO, ip) =“ E Sg = VO) (4.94) 
q 


The second diagram in the series has a particle line as an intermediate state. This self-energy 
term is 


, N; , 
IOP, ip) =F DO Syta VDV (a) + ah, ipa) (4.95) 
qı 92 


which may also be written as (v > oo) 


dp’ Vip—p')V(p' — p) 
(2n)° ID, — Ép 


There is no summation over internal frequency variables. The impurity is considered rigid and 
cannot absorb or transfer energy to the particle which is scattering. Consequently, all the 
internal lines have the same energy ip, as the initial particle. This result is not assumed but 
instead is derived directly from the Green’s function expansion. For example, the above self- 
energy term arose from second-order terms and had the t integrals 


52(p, ip,) = n; | (4.96) 


Bo o B æ 
| ares | dey | ar80,- 1080P + ar ti = 12O, t) 
0 0 0 


= Gp, ip, 9 (p + qi, ipn) (4.97) 


The fact that all electron lines have the same energy comes directly from the t integrals. It is 
true for an arbitrary order of diagram—all particle Green’s functions have the same energy 
ip,. As stated before, there is no other excitation in the model to which energy may be 
transferred. 

The third term in the series in Fig. 4.3 is 


d'p, dp, V(p— p VP: — PV — p) 
(27) (27) (ip, _ Sp, Pn E Ép) 


At this point, one may deduce the nth term by inspection. It is 


EO(p, ip) =n, l 


d’ d? 
EO, ip) =m, | OPS EVO -pV -P)O 


x TH GOW; ipn) (4.98) 


Now that the nth term is evident, it is also possible to sum the series of terms and get the total 
self-energy which is proportional to the concentration n;. The first step in this summation is to 
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write an integral equation for the self-energy. This integral equation is in terms of a vertex 
function I(p’, p) which has the equation 


/ / dp, V(p’ 7 p(y, p) 
F(p’, p) = Vp — = sree 4.99 
(P. p) =V p)+ | Qn) P-E (4.99) 
_ (n) d? Pı VP — p DI P. p) 
Es ip) = È EO, p) =ar O + [EE PP P 
= n;l (p, p) (4.100) 


Repeated iteration of Eq. (4.99) will generate the series of terms which give the successive 
diagrams in Fig. 4.3, of which (4.98) is the nth term. The series is summed by solving the 
integral equation. To this end, define the function 


d'p, e™: "T (p}, p) 


- (4.101) 
(27) iP, — Ey 


n(r. p) = | 


This quantity and I'(p’, p) are both functions of ip,, but this dependence is not explicitly 
added to the notation. In terms of this new function, the self-energy (4.100) is 


X(p, ipa) = n,V(0) +n; | Bre” "V(a\nt p) (4.102) 


To solve for the function n(r, p), consider the effects of the differential operator acting on it: 


h2 d n 
50? -n-p fem) = [ER aop Pr i ‘T(p’, p) 


3 


h? > , _ Ch wr 
-£v -u-i eD = - fE PT (p’, p) (4.103) 


The right-hand side of this equation may be simplified. First, note that the vertex function may 
be written as 


F(p’, p) = [dre "Vere" + (r, p)] (4.104) 
which follows directly from its definition (4.99). From this relation, it is easy to show that 
| r Tp’, pe” "= Ve" + x0, p)] (4.105) 

It is deduced that the function obeys an inhomogeneous differential equation: 
|- Fy —w-—ip, + vo) nlr, p) = —V (rje? `" (4.106) 


This equation is solved in the following fashion. First find the solution to the following 
Schrödinger equation: 


2 
mh =| -EV rO Walt) = Eh (4.107) 


This differential equation was discussed extensively at the beginning of this section. Take the 
outgoing wave boundary conditions. The reason for this choice will be clear below. In terms 
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of these wave functions, the special function n(r, p) has the following solution to the inho- 
mogeneous equation (4.106): 


Wa, (r) T. Ap 


Tr, p) = [erer Vir)» Hm 2 ip, + u — E, 


- 4.108 
z Wy +H Ey i ) 
where the T matrix is defined in (4.47). This result for n(r, p) can be directly verified by 


operating on both sides by [H — u — ip„] and then using the completeness property: 
LV) = rr) (4.109) 


The summation À is taken over all the eigenstates of H, which include the bound states as well 
as the continuum states. When this result is used in (4.102), the final result is obtained for the 
self-energy in terms of the T matrices which were defined in (4.47): 


[Trpl? 
X(p, ip,,) = nV (0) + n; $ ——*—_— 
PP bp, tu- E 


(4.110) 
Equation (4.110) is the exact term in the self-energy which is proportional to the concen- 
tration n;. It has been obtained by summing the set of diagrams shown in Fig. 4.3, where the 
particle has multiple scattering events from the same impurity. There are other terms in the 
self-energy which are proportional to higher powers of the concentration (n;)”. For example, 
one term which contributes to order n? is shown in Fig. 4.2(b). However, in the limit of dilute 
impurities, as n; — 0, the result (4.110) is the most important. 

Our interest is usually in the retarded self-energy, which is called X(p, œ), omitting the 
“ret” subscript: 


[Tpl 
Z(p, 0) = n,V(0) +7, 


O E TB (4.111) 


There is one particular case which is very important: when the energy œ is set equal to &,. 
Then the self-energy (4.111) is just n; times the diagonal T matrix: 


E(P.&) = nTa = — ED AI + UO sinap) (4.112) 


This identification is made by noting that the 7-matrix equation (4.49) is identical to the vertex 
equation (4.99) when ip, — §, + id. This result is frequently used in representing the self- 
energy of the particle. The imaginary part of the self-energy has a simple formula: 


—2 Im[X(p, o)] = 27n; 3 [Typ Slo — E,) (4.113) 


The right-hand side may be further reduced in the special case œ = €,.. If the two prior results 
(4.55) and (4.56) are combined, the result may be shown to be proportional to the scattering 
cross section: 


—2 Im[2(p, §)] = —2n; Im{T,,} = nv o(p) (4.114) 
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The imaginary part of the self-energy for œ = &, is the cross section times v,n,;. The factor 
v,n; is just the rate at which the particle encounters the impurities. The lifetime and mean- 


free-path of the particle may be identified as 


1 
y= MPO) (4.115) 


1 
ly = nop) = U,t(P) (4.1 16) 


These results provide an example of the earlier assertion that the imaginary part of the 
retarded self-energy is related to the damping of the particle. 

An interesting question is the relationship between these self-energy expressions and 
Fumi’s theorem (4.75). The two results are quite compatible. The self-energy expressions 
describe the effect of the impurities on single-particle states. Fumi’s theorem gives the energy 
change of the system. Fumi’s theorem can be derived by averaging the single-particle 
properties. This derivation is shown in the next section. 


4.1.5. Ground State Energy 


The ground state energy is derived starting from the self-energy expression (4.111). 
When averaged correctly, this expression gives Fumi’s theorem (4.75) for the energy per 
impurity. What is proved, say at zero temperature, is that the impurities cause a change in the 
ground state energy of 


kr 

AQ = —N, =| pdp X (21 + 1)8)(p) + O(N?) (4.117) 
0 i 

Terms proportional to O(N?) give energy terms arising from interactions between impurities. 

The term proportional to a single power N, is the average energy per impurity from inter- 

actions with the electrons. 

The result (4.117) has an interesting history. Most of our references have been to solid 
state physics work such as that of Friedel and Fumi. A parallel development was occurring in 
nuclear theory. Brueckner et al. (1954) proposed that the ground state energy of the system 
was an average over the energy of the single-particle states. The energy per particle was taken 
to be the reaction matrix, so they took the energy change to be 


3 kr 
AQ = 2N, Er ER = v= | pdp (21 + 1) tan[8y(p) 


This equation has tan[6,| instead of 6, and so is incorrect. Fukuda and Newton (1956) showed 
that the correct result should be 6, rather than tan[6,]. In fact, they give results identical to 
those of Fumi. Fukuda and Newton, and also DeWitt (1956), proved some important theo- 
rems regarding the energy (4.117). The result is not dependent on the spherical box which 
was used in the derivation; the same result was demonstrated for a cube and other shapes. 
Similarly, it does not depend on the assumption that the wave function vanished at the surface 
of the box. The same result is obtained for other boundary conditions. Fumi’s theorem appears 
to be a general result. 


Sec. 4.1 o Potential Scattering 205 


The ground state energy is calculated using the formula (3.300) derived earlier, which 
contains a coupling constant integration 


d'p f dn 
= E = = pfi T Gip, ip, EM, ip, n) (4.118) 


Since only the term proportional to n; is needed, and since the self-energy ÈX is already 
proportional to n;, then replace G by G. The difference is just self-energy terms which give 
higher powers in (n;)", which are going to be neglected. The coupling constant integration is 
important for weighting each factor V of impurity potential (and not each factor of the T 
matrix). From (4.98), this self-energy is 


_ y. dp Yop Vpop Vo'p 
X(p, ip, =n] nm +7 EE, Qn) pE, 
3 l d°p' H p” Vop’ Vy 'p” Vp'p 
(2n)° J (2n)° (ip — §& ip — E) (4.119) 
— dp 1 1 dp Vow pp Vyp | 
a= avi | a PBS on ee P-E 


1 l dp | dp" V op’ Vy 'p” Vyp + 
(2n)° J 2r (ip — §& ip — &p) 

where the coupling constant integration has been done. There are several ways to evaluate this 

expression. 

The first derivation will give tan(6,), instead of 5,, so that it gives the wrong answer. This 
derivation is included, not just for fun, but to show how simple it can be to get the wrong 
result. While reading this derivation, try to find the incorrect step, which leads to the wrong 
result. The incorrect derivation proceeds by doing the summation on Matsubara frequencies 
in (4.119) term by term: 


l , 
> 9 (p,ip) = ne (Ép) (4.120) 
ip 
i n — Nn ! 
=> GO p, ip) Gp’, ip) = "nr (Gp) — Re (Sp) (4.121) 
B ip Ép ~ Ép 
The nth term is found from the contour integral, 
dz 1 
ac z127 = 4.12 
+ ne) z-Ę 0 (4.122) 


where the contour is taken at infinity so that it includes all the poles. The contour integral 
leads to the identity 


R Pu 120p; ip) = > nr), iG”; 6) (4.123) 
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The expression for the ground state energy becomes 


aoon [os et oN + A eal 


Veo nr (E,)9 OW’, Ep) + nr (E)GO (EI) + + | (4.124) 


There are two terms proportional to V*. They are identical, after changing around the dummy 
variables of integration, so this cancels the 5 factor. Similarly, there are three terms with three 
powers of V which are all equal, which cancels the } factor. The 1/n factor is canceled in the 
nth term: 


i dp d°p' Vey 
AQ = 2N, i In) znr (É Dret ar es Sp — Sp’ 


dp (dp Vow op Yop 4.125 
+] (On): | (any? E- Ey Ey — Ey) (4.125) 


The series in brackets is just the reaction matrix equation (4.34), so that 


P 2N; {*" 
AQ = 2N, | dtnp(E,)Rpp =- | pdp Ze + 1) tan[5,(p)] (4.126) 


The derivation gives tan ô, which is the wrong answer. Did you find the wrong step? The 
reaction matrix equation (4.126) contains a P for principal parts. It is an instruction to omit 
the term where the energy denominator vanishes. The T-matrix equation does not have this 
instruction, because one is supposed to include such terms. The derivation mishandled these 
terms. The “theorem” (4.123) is correct only as long as the &, are different; it is incorrect if 
any are alike. The correct result, valid for all cases, is obtained by evaluating the contour 
integral (4.122) by drawing a branch cut along the real axis. The summation over Matsubara 
frequencies is the difference between the integration along the top and bottom of the branch 
cut (see Sec. 3.5). When taking the imaginary part of the right-hand side, it is possible to get 
products of odd numbers of delta functions. For n = 3 the result is 


l 1 


BY Wp — Ep — aE) 
_ nr(§1) 4 nr(§2) 4 nr(§3) 
(Si — 6261 — §3) (62 — 61X62 — &3) (83 — Sa) — &1) 
- wn (E)5(E 1 — &))0(S; — §3) (4.127) 


The last term contains the product of delta functions. This term was omitted in the first 
derivation. Similar correction terms, with odd powers of delta functions, occur in the fourth 
and all higher terms in the perturbation series (4.124). An examination of these terms shows 
that they are generating a series of reaction matrices: 


3 
AQ = 2, | 22. 
1 ( 3 


dp d 
27) ne) Roy -5 | Pid p? —- e Rpp, Rpp, Rpp 6p — 6p OE, — bp) + °° | 


(2n)° 
(4.128) 
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The wave vector integrals may be done term by term. The d°p, integral eliminates the delta 
function 0(§, — ¢,, ) and makes p,, = p. The angular integrations force all angular momentum 
I components to be the same. These steps derive the series 


2N, [© 
AQ =| pdp (21+ 1 
i| Pee yX +1) 


x {[[4n" pR (p, p)] — 4 [407 pR (p, p? +4 [4x7 pR p, p — e} (4.129) 


The factor 4n*pR/(p, p) = —tan[5,(p)]. The series is just that for the tan™!(x)= 
x—x/3+x/5... where x = — tan(ò). The ground state energy per particle as given by 
Fumi’s theorem is: 


kp 
AQ = —2Ni | pdp S21 + 1)8,(p) (4.130) 
TMM Jo ] 


This derivation contains two messages. The first is that Fumi’s theorem with 6,(p) is correct 
and may be obtained by using the conventional formulas for the ground state energy of the 
system. The second message is that the energy of the system is not just a simple average of 
the single-particle energies, such as Z nr (Ep)Rpp OF Upnr(S,)T pp. Both of these guesses are 
incorrect. 

A more elegant derivation of Fumi’s theorem is possible by following the techniques of 
Langer and Ambegaokar (1961). They avoided evaluating the series (4.119) on a term-by- 
term basis. Instead, they used operator manipulations to attain the above result. 


4.2. LOCALIZED STATE IN THE CONTINUUM 


In this section the following Hamiltonian is solved exactly: 


H =e,b'b+ Ylexcic, + Ay(clb + bte,)] (4.131) 
k 


It describes a localized state of fixed energy £, and operators b and bt. The localized state will 
be called the impurity, and it is assumed only one exists. There is a continuous set of states of 
energy &, with operators c, and ae This set of states could have a finite bandwidth, as often 
occurs in tight-binding models in solids, or else it could be a free-particle model. The last 
term in the Hamiltonian includes the mixing between these two kinds of states. It contains 
processes whereby the continuum particle hops onto the impurity (btc) and where the 
particle hops off the impurity into the continuum (cib). If the particles have spin, it is 
assumed that the hopping on and off the impurity preserves the particle spin state. The spin 
never changes and is unimportant, so its dependence is suppressed in all subscripts and labels. 
If the hopping particle could change its spin orientation, the problem would become harder. 

Since the Hamiltonian is quadratic in operators, its solution is equivalent to diag- 
onalizing a matrix. The solution may be obtained in this fashion, but here it is done analy- 
tically. Since the Hamiltonian is quadratic, the statistics are irrelevant since the same 
eigenvalues are obtained for fermions or bosons. It is also irrelevant whether there are one or 
many particles in the system. For a Hamiltonian which contains only quadratic operators of 
fermions or bosons, one just diagonalizes the Hamiltonian to find the eigenstates, and then all 
eigenstates behave independently. 
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This model Hamiltonian was introduced simultaneously by Anderson (1961) and Fano 
(1961). Anderson applied it to solid-state physics, while Fano used it in atomic spectra. It 
tends to be called the Anderson model or the Fano model depending on whether the speaker is 
a solid state or atomic physicist. Here it is called the Fano—Anderson model. As explained in 
Chapter 1, it is related to the famous Anderson model, which is (4.131) plus another term. 

The Hamiltonian will first be solved without using Green’s functions. Afterwards, the 
Green’s function solution will be given. The nature of the solution depends critically on 
whether the energy £, is within the band of states ¢,. If the continuous band of states €ẹ 1s 
confined to the range 


W] < Ek < Wa (4.132) 


then the solution depends on whether £, is also within this range. Actually, this statement is 
incorrect. Because of the interactions with the continuous band of states, the energy of the 
localized state is altered to a new value €.. Of course, since the Hamiltonian is not yet solved, 
the renormalized energy €, is not yet known. Jumping ahead, the final result is that the new 
energy is 


(4.133) 


If €. is within the range 


then the solution has an important property: there are no localized states in the system. A 
continuum particle may hop onto the impurity, but after a while it may hop off again. Particles 
spend only part of their time on the impurity, so it is not a well-defined eigenstate. The 
impurity state has become a scattering resonance. 

Of course, if the new energy &, is outside of the band of continuum states, then a true 
localized state will exist. Then the solution has a distinct form—a real bound state. 

First solve for the case where €, is within the continuous band and no bound state exists. 
All states are continuum states. A new set of operators a, and a are defined which refer to 
the eigenstates of (4.131). The old operators b and c, can be expanded in terms of the new 
set: 


b= XO Vkæk 
k 


Ck = $ Nk e Me (4.135) 
k’ 


One impurity in the presence of N ~ 10% particles changes their energy by a negligible 
amount—remember the assumption of no bound states—so that these new operators still have 
the energy €x. All that is needed is the vector v, and the matrix n, w for a complete solution. 
The model is solved first in one dimension. The extension to two and three dimensions will be 
easy to describe at the end. It is also assumed that the dispersion €, does not permit two states 
k and k’ to have the same energy. Of course, this constraint is never realized in practice, since 
states of k and —k usually have the same energy. The problem is that the hopping on and off 
the impurity preserves energy but not wave vector information. The hopping will mix the 
states which exist at the same energy. It is a nuisance to describe, so it will be omitted for the 
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moment, and k becomes a scalar. One way around the problem in one dimension is to 
describe symmetric and anti-symmetric combinations of the states with +k: 


l 

Cis = lt -+ C 4] (4.136) 
l 

Cka = pi — c_,| (4.137) 


These two states have the same energy €}. Only the symmetric state is assumed to interact 
with the localized state, so that the interaction has the form `, Albi cy + ch b]. This 
interpretation of the Hamiltonian makes physical sense. The subscripts s will no longer be 
written. 

Since the new operators a, and H describe eigenstates, the Hamiltonian may be written 
as 


H = > EALO (4.138) 
k 


The commutator [b, H] is evaluated using both the new operators and the old: 
[b, H] = be, + X Apc, = > Ep Vp Op (4.139) 
k k 


Then the old operators are expressed as (4.135), and this equation is 


Ec > Vr Xp + AEN EK! = 2 ELVEK (4. 140) 
k kk’ 


Each coefficient of «, is independent; i.e., take the commutator of the above equation with al 
and get 


Vy(€z — €e) = DAW Nik (4.141) 


The same procedure is used with the commutator [c,, H], which gives 


[ck, H] = ec, + Apb = 3 Nk, e Ek Xk (4.142) 


or expressing b and c, as «,, gives another equation: 
Ng w (Ex — Ep) = —A,Vp (4.143) 


The last equation gives a result for ng p, when €, Æ €y, which means that k Æ k’, because of 
our assumptions. It provides no information about ng w for the case where k =k’. It is 
necessary to introduce another unknown function Z, which is proportional to the value of 
Ng w When k = k’: 


AgVy 


Nk, k = — + On Z VgAk (4.144) 


Ez — Ep 
The other factors 4v, are added to the last term for convenience. The energy denominator in 
the first term is taken as a principal part; the term is omitted when £, = £y. All energy 
denominators are assumed to be principal parts, and the conventional symbol P will be 
omitted. If the expression for n4 y is used in (4.141), the equation for v, becomes 


Veler — £e) = 2 Ay (- 
k' € 


Ayv 
k . + Bu Zavede) (4.145) 
k 


ki 
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The v, dependence factors out completely. This factoring leaves an equation in which the only 
unknown quantity is Z}, 


At 
e —& = Y — + 7,4 (4.146) 


so that Z, quantity is now determined. It also simplifies the notation to introduce the self- 
energy function 


A2, 
X(E,) = $ —*—_ (4.147) 
k Ek T Er 


so that Z, is written as 
1 
Z; = ge — E€; — X(€,)] (4.148) 


All quantities on the right-hand side of (4.148) are known, so that Z, is now known. To make 
further progress toward obtaining n; y and v}, more equations are needed. They are obtained 
from the commutation relations for the old operators, which must still be obeyed when they 
are expressed in terms of the a, operators:: 


[b,b] =1= 5v (4.149) 
k 

[Ch ch] = Opps = 3 Nk Ne, k” (4.150) 

[b, ct] = 0 = 2 Nee Ve (4.151) 


The last equation is used first. If the result (4.144) for N4 w is used, then (4.151) becomes 


A,Vp 
0 = > ve( kVk + Bu Zavidr ) (4.152) 
k Ex — Er 
or 
v2, 
0= A ( Zit +5% — ) (4.153) 
k Ey — Ey 


Since A, is not zero, the quantity in parentheses must vanish. This result will be used below. 
Next, take Eq. (4.150), insert the result for ng w, and find 


A v H A / i 
On = > ie ke + On Zp VA k Vi + Opp Ly VpAp 
k” NE — Ek Ep — Ep 
which equals 
Z y? — Z V2, v, 
Sie = Öp ZVA? + A,Ap -EE 4 ArAp Y ammm (4.154) 
E — Ep kek k" (Ep — E,)(Ep0 — Ep) 
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The last term must be rearranged. Since the energy denominators are principal parts, use 
Poincaré’s theorem: 


I 1 1 
P =P ( l _t ) 
Ey — Ex Egr — Ey E&E — Ey Eg” — Ez Eg” — Ey 
+ T 8(Epr — €,)5(Epr — Ep) (4.155) 


The delta functions of continuous energy must be changed back to Kronecker deltas, since we 
are using box normalization in a box of length L. This alteration is 


L 
Ò — 7 Ò / 4, 15 
(e; Ey) Inv, Ol (4.156) 
where v; is the velocity of the particle: 
OE; 
—— 4.1 
Vi = OE (4.157) 


The terms can be regrouped in (4.154): 


L 2 
Sip = Spy VA? z+ (2) 
kk kk’ Yk ll k 2v; 


A A / 2, 2 
4 (z +y —E ) — (zev + z) (4.158) 


Ek — Ey k" Ep — Ek k" Ex — Ep 


The last two quantities in parentheses vanish, since each is identical to (4.153). All the 
remaining terms are proportional to y, and so exist only when k = k’. These steps give the 


equation for vz: 
LN 


Of course, once v; is known, then ng y is obtained easily from (4.144). The result for v, may 
be reworked into a form in which the physics is more transparent. The earlier result (4.148) 
for Z, is used to rewrite (4.159) as 


Ay 


2 
vt Te, — e, — Le + (LA2/204)° 


(4.160) 


Furthermore, the self energy X(,) will be interpreted as the real part of the retarded self- 
energy: 


Az, 
E a(€) = X — 4.161 
ret(€) Dr Fr (4.161) 
A, 
Re[Z,e(€,)] = PX —*— = Xe) (4.162) 
k Ek T by 


Similarly, the imaginary part of the retarded self-energy is 
Im[E,e(£)] = -r X AL 8(e — £y) (4.163) 
k! 
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By using the previous identity (4.156) for the delta function, the imaginary part of the 
retarded self-energy becomes 


L L 
Im[2,(e)] = -r > AZ, (5) Sy = — 5, Ak (4.164) 
k vk Uk 


The expression on the right is recognized as the same factor which occurs in the denominator 
of (4.160) for vz. In fact, this quantity may be rewritten as 


vV- (7) Im[X(€;)] 
t L J fe, — £, — Re E(e})f + [Im E(e,)P° 


(m 1 
7 ( L Jmf- =E ot (4.166) 


so that v2 is proportional to the imaginary part of a retarded Green’s function. It was shown in 
Sec. 3.3 that the imaginary part of a retarded Green’s function is proportional to the spectral 
function: 


(4.165) 


] 
A(e) = —2 Im] — (4.167) 


v = ZO (4.168) 


The remaining equation (4.149) which must be satisfied may now be considered: 
l=yv (4.169) 
k 


Since v? is now known, it is readily substituted, and the integral is: 


— yk ye) = [28 


In the second equality the integration variable was altered to the energy of the particle. This 
integral is just the sum rule for spectral functions which was proved in (3.121). The sum rule 
applies to any spectral function and must apply to the one derived above. Equation (4.149) is 
satisfied. The transformation (4.135) to the «, has been achieved. The form derived for v, 
satisfies all the commutation relations (4.149)-(4.151) as well as the commutation relations 
with the original Hamiltonian. The problem has been solved exactly. 

Since the quantity 


Gle) = —— (4.171) 


co 2 ret (e) 


is identified as a retarded Green’s function, for which particle is it the Green’s function? It is 
the Green’s function of the localized particle, or what became of the localized particle after it 
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became delocalized. This assertion is somewhat evident by considering the definition of the 
localized retarded Green’s function for fermions: 


GA) = —i@(1)(b()b* (0) + DOLA) 
= -0OY Vy (otg(t)04h (0) + of (O)or, (#)) 


= -iOH E Ve = — OW) E Alee 
k k 


. ~~ d Ek —ié,t 
= -i00 | — A(é,)e (4.172) 
oo 21 
In the last step the summation is changed to an integral over ¢,. The last line is the identity for 
the retarded Green’s function assigned in Problem 6 of Chapter 3. 

The Fano—Anderson model is just a description of a localized scattering resonance. The 
continuum particles come to the impurity, spend some time in the resonant state, and then 
depart in another continuum state. This model is really no different from that of the prior 
section, on impurity scattering, if the impurity potential is made to have a resonance. The 
phase shifts for the present model are defined as 


tan[ 5 ( €)] — - Im[È,«(e)] 


rete 4.173 
—& Re Lret(€) l ) 


The phase shift is all that is needed to use the results of the prior section to describe the Fano- 
Anderson model. The resonant behavior comes from approximating -Im £ ~T as a 
constant, or at least as a slowly varying function of energy. Then the spectral function A(€) is 
Lorentzian. If the width I" is small, this function describes a resonance. 

As an example of a scattering potential, consider the potential V(r) shown in Fig. 4.4(a). 
The resonance behavior is chosen by first solving for the bound states of Fig. 4.4(b). For the 
latter case, bound states exist when 


tan(ka) = -$ (4.174) 
1/2 
x = (4 — e) (4.175) 


A bound state at half of the well depth is made by choosing ka = aa = 37/4. Here k? is the 
particle energy relative to the bottom of the well. The well shape of Fig. 4.4(a) will have a 
resonance at nearly the same energy. The finite extent of the repulsive part will allow the 
particle to leave and impart a width to the state and to the scattering resonance. The phase 
shift 1s shown in Fig. 4.4(c) for the potential of part (a). The steep rise in the phase shift, of 
about 7m, occurs at the value ka ~ 31/4. 

The change in phase shift by 7 across the resonance may be understood from the Friedel 
sum rule. As the eigenstates of the system are filled up, one particle will reside in the vicinity 
of the impurity and contribute unity to the Friedel sum over 6/n. 

The initial Hamiltonian was written as (4.131) in terms of eigenstates which were not 
orthogonal. All of our effort was merely an exercise in orthogonalization. This work could 
have been avoided by writing H in an orthogonal basis. The basis would have the particles 
with a scattering resonance from the impurity. The Hamiltonian describes one-particle 
behavior. 
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FIGURE 4.4 (a) A potential whose phase shifts can show resonance behavior. The resonance occurs at the bound 
state of the potential shown in (b). (c) The phase shift jumps in value by almost 1 radians at the wave vector of the 
resonance. 


The model is very important in a number of applications. The d-states of transition 
metals seem well described as scattering resonances of the sp electrons (see Anderson and 
McMillan, 1967). Another example is in surface physics. When atoms come from a gas and 
are absorbed on the surface of a metal, the conduction electrons of the metal may hop onto the 
atomic-like states of the absorbed atom. The charging effect on the absorbed atoms seems 
well described by the model (Schénhammer and Gunnarsson, 1977). It has also found 
numerous applications in atomic physics. 

The model will now be solved by using Green’s functions. This solution is much quicker 
than the prior method, since there is only one self-energy diagram. The Hamiltonian is written 
as Hy) + V, where V is the last term in (4.131) and Họ is the first two terms. The Green’s 
function for the localized state is: 


B 
Glip) = — | dre?" (T.b(t)b*(0)) (4.176) 


The first self-energy term comes from the n = 2 term in the S-matrix expansion: 


1P — pP B 
-l | ner | ax, | dr, Y Ap Ap, (b(t) 
0 0 0 ky ky 


x [en TB" (ty) + Bt et. Dle (ta)B" (ty) + Bladel (ETO) (4.177) 
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The correlation function is easily evaluated in terms of the unperturbed Green’s functions of 
the localized state Y(t) and the continuum states 4% (k, T) to give 


B — B B 
24i | dte?” | dt | dG (t — 1) GOK, t1 — T2) (T) 


= Y AZO (ipf G(R, ip) = G (ip X(ip) (4.178) 
k 
AZ 
Lip) = ARG (k, ip) = Z — (4.179) 
k k Ip — E 


X(ip) turns out to be the only self-energy diagram. The higher terms in the S matrix only 
produce higher powers of this self-energy contribution: 


Glip) = 9 (pL + GOD + (OLY +--+] (4.180) 


The series may be summed to give the Dyson equation for the Matsubara Green’s function: 


(ip) =—— 


aoe Dep (4.181) 


Changing ip > £ — u + ið gives the retarded Green’s function, which is the same as noted 
earlier in (4.171). This derivation proves that vz is really proportional to the spectral function 
of the G(s). The equivalence of this result to (4.172) is just an example of the general theorem 
proved in Problem 6 of Chapter 3: 
° de | 
G «(b = -i00 | 5, Awe rer (4.182) 


OO 

These results may be generalized to higher dimension and other energy bands. This 
extension is most easily accomplished using the Green’s function technique, since the steps 
are the same as in one dimension. The Matsubara form of the Green’s function of the b 
operators (4.172) is still given by Eq. (4.181) where the self-energy operator is summed over 
all states in the system: 


2 


X(ip) = 3 os (4.183) 


The real part of the denominator of the retarded Green’s function is (ip —> £) 
e — £, — Re[2,4(€)| (4.184) 


The Green’s function has poles at the points where the denominator vanishes. The poles of the 
Green’s function corresponds to excitations of the system. The poles are at the energy €, 
which satisfies 


č, = £, + RelZ,e(E,)] (4.185) 


The above expression for £, is the result asserted earlier in (4.133). If the pole lies within the 
continuum of states, then the resonance occurs, and there is no bound state. But if the pole 
occurs outside of the band of continuum states, then the system has a real bound state. A pole 
occurs when Im È = 0 at the same point that the real part of the denominator vanishes. 
Generally, Im È is not zero throughout the continuum band, so that Im È = 0 only outside of 
the band. 
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If it is assumed that the bound state occurs outside of the band, the spectral function has 
the form 


—2 Im X(e) 


A(€) = 2nd(e — £, — Re[X(e)]) + lee, — Re EF + [lm = 


(4.186) 


The first term comes from the pole of the Green’s function, while the second term is from 
those parts where Im 2 + 0. The first term may be rewritten as a simple delta function plus a 
renormalization factor. This procedure was described earlier in (3.155): 


-2 Im È 
A(g) = 2nZ8(e — &,) + — r 4.187 
(E) = 228 — £.) [e — £, — Re £f + [Im £} (4187) 
—1 
Z= (1 -2 =) (4.188) 


In other examples there may be more than one bound state, in which case the first term 
becomes a series of delta functions. 

These points are well illustrated by an example. Take a one-dimensional tight-binding 
model on a solid of unit separation a = 1 and length Z with a constant coupling constant: 


€, = —2w cos(k) (4.189) 
At = - (4.190) 


Also, take the initial bound state energy £, in the middle of the band €, = 0. The band 
structure is shown in Fig. 4.5. The Brillouin zone is defined by —n < k < n. The self-energy 
function &,.,(€) is now elementary to evaluate: 


C l C f" dk 
> = — —_—————— um E — aN ee 
ret(€) L 2 € +w cos(k)+id 2r l nE +w cos(k) + id 


The real part is 


C úi dk C sgn(e) .. > 
Re] = — | ——————— = — if 
el] =| see cos(k) /g2 — y2’ if e > w? 
= 0 if <w (4.191) 


FIGURE 4.5 Tight-binding energy bands in one dimension. 
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where sgn(s) is the sign of £. The imaginary part is 


—Im[X(e)] = < [ dfe +w cos(k)| (4.192) 


_ C _ C 
— wisin(k)| Vw- e 


The spectral function of the localized state is 


C C 
A(é) = O(c? — Dzn £ — - | + 2nd + —=—.| 
2CO(w? — £?) 


H Ve le? +C/w — 2) 


The step function ©(x) is unity if the argument is positive and zero if negative. The first term 
is nonzero only outside of the band of continuum states, while the last term is nonzero only 
inside the band. Figure 4.6 shows a plot of the entire spectral function. There are two sharp 
bound states, one below the band and one above. The continuous contribution throughout the 
band is due to the last term in (4.194). According to the sum rule, the total area under all the 
contributions must be 27. The solution to the equation 


if w > g? (4.193) 


(4.194) 


~ C 
is 
~ 1 
E, = +— [w + Vw +.4027]'” (4.196) 
J/2 
and the renormalization factor is 
2 — w? 
Z. = < . 
c= 3 (4.197) 


and is the same at both poles. For strong coupling, where C becomes very large, €, >> w. The 
renormalization factors approach }, so that one-half the spectral weight is in each pole. 


Ale) 


-Æ -10 -0.5 O 0.5 lO ç 


FIGURE 4.6 Spectral function from Eqn. (4.196). C = 1,@ = 1.0. 
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4.3. INDEPENDENT BOSON MODELS 


The independent boson model is very important in many-body physics. It is an exactly 
solvable model which describes some relaxation phenomena. It has become very useful for 
describing a wide variety of effects in solid state physics. This section discusses it at great 
length. Two different derivations of the basic mathematical result are provided. In addition, 
several variations on the model are described briefly. An exact solution may be obtained by a 
variety of techniques. The usual procedure is followed: first solve it by ordinary operator 
algebra, and later solve it using Green’s functions. 

The first Hamiltonian which will be solved is 


H = eela + P (a, + “| +> ogla (4.198) 
q q 


The Hamiltonian describes a fixed particle of energy £, interacting with a set of phonons with 
frequencies ©. The interaction occurs only when the state is occupied and cle = 1. The 
phonons are the independent bosons. An alert reader will note that this model was solved 
exactly in Chapter 1. By making a canonical transformation, the Hamiltonian may be 
rewritten as [see (1.239)] 


H =c'c(e,— A) + Yoyaja, (4.199) 
q 
where the self-energy 1s 
M2 
A=% — (4.200) 
q ®q 


The solution to this problem is identical with the problem of a charge on a harmonic spring in 
a uniform electric field. The electric field causes a displacement of the charge to a new 
equilibrium position, about which it vibrates with the same frequency as before; see (1.38). 

The present objective is to obtain a better description of the fluctuations about equili- 
brium. The self-energy A is for the zero temperature ground state configuration of the system. 
To study relaxation effects, the fluctuations need to be understood. They are obtained from 
evaluating the time variation of the Green’s function. 


G(t) = —i(T,c(t)c'(0)) (4.201) 


It is solved for the real-time Green’s function, which is permissible in the present case because 
the single impurity state c'c will not alter the phonon energies. In the thermodynamic 
averaging over the phonon states, the perturbation V has no effect, so there is no need to 
worry about the perturbation expansion for the factor exp(—B#). First solve the Green’s 
function of time at nonzero temperature. 


4.3.1. Solution by Canonical Transformation 


The Hamiltonian is first solved by a canonical transformation. This solution is nearly 
identical to the procedure described in Chapter 1, but here it is done with more rigor. A new 


Hamiltonian is desired by a transformation of the type 
H=&He*® = tele- A) +Y Ogala 
q 


4 (4.202) 
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The transformation must be done so that S? = —S. The transformation on any product of 
operators is done by taking the product of the transformed operators. The last theorem is 
shown by inserting 1 = e~*e°* between each operator: 


e ABCD.. -Ze = (e Ae (Be 5 )(e Ce) - - - (e°Ze~5) = ABC: -Z 


It is assumed that any function of operators may be expressed as a power series, and the 
transformation on a function of operators is just the function of the transformed operators: 


SFA = F a, d"e = Y a (AF = f(A) (4.203) 
n=0 n 


=0 


The transformation is considered on each operator separately, and the transformed Hamil- 
tonian is the old one with the new operators. These are evaluated using 


- l 
A = d Ae™ = A + [S, A] + 51S: [(S, A]] +- (4.204) 
M, 
S=cded — (al — ay) (4.205) 
q %q 
which gives 
C= cX 
cl = cxt 
M, 
Ag = Ag — —A cdte (4.206) 
Og 
M, 
al = al — Adc 
Oq 
where the operator is 
X= Maa 4.207 
= exp Ea T ( . ) 


Since X commutes with the c operator, the number operator is the same in the new repre- 
sentation, 


cte = ctcXtX =cle (4.208) 


since Xt = X—!. The transformed Hamiltonian is 
_ M M 
— eats + _ 4 ot aot 
H = EC c+ Fofa Oy. e) a ol -) 


M 
+) Ma a + ay — 2Tete) (4.209) 
q q 


= cle(e, — A) + X ayaia, (4.210) 
q 
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This transformed Hamiltonian is precisely the form which was the objective in (4.202). It is 
also the form which was derived earlier in Chapter 1. The reason for repeating the analysis is 
that the important factors X in (4.207) are needed for the discussion of fluctuations. 

This transformation is now applied to the Green’s function (4.201). The factor 1 = e~*e% 
is inserted into the trace, say for t > 0: 


Tr(e PF elt ce iH ch S05) (4.211) 
Using the cyclic properties of the trace, 
Tr(eSe7 BF ece ct eS) = Tr(e BA ece Hgt) (4.212) 


By using the previous theorems, everything in the trace is now changed to the transformed 
representation. The Green’s function may be written as (t > 0) 


G(t) = —ie®°Tr(e PH eP zeit) (4.213) 


It should be emphasized that this G(t) will be exactly equal to the earlier definition (4.201). 
The new equation for G(t) is just another way of evaluating the same thing. 

At first glance it appears that this evaluation is now trivial, since the Hamiltonian is 
diagonal in the c'c-operators. But this conclusion is untrue and misleading, since Z and ct do 
not commute with a or a! because of the X factor in (4.206). It is necessary to stick with the c 
and a representation and to put in X explicitly. The Green’s function becomes 


G(t) = —ie®°Tr(ePH e#*eXe iH ch yt) (4.214) 


However, it is now possible to achieve a great simplification. Since H is diagonal in c and a, it 
is easy to commute it through cX and to obtain the time development of the operators: 


elt Yet _ eit. o¥ (t) (4.215) 
Ma t iat —inqt 
X(t) = exp| — de ae £ — age") (4.216) 
a %q 


The phonon and electron parts of the trace may now be completely separated. For (t > 0): 


G(t) = ~ie”? Tr(e Pee" cl) Tre PX (1)X" (0)] (4.217) 
E, =£, — À (4.218) 
H, = 2 Oe (4.219) 


The result for G(t) has the great simplification mentioned above. The particle part is trivial 
(assuming they are fermions), 


ePOaTr(e Pee" ech) = 1 — nz (E,) (4.220) 


and there remains only the problem of evaluating the phonon part of the trace. This evaluation 
is nontrivial, although it may be done exactly. The operator disentangling method was 
introduced by Feynman (1951). 
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4.3.2. Feynman Disentangling of Operators 
The objective is now to evaluate the trace over the phonon distributions of the operator: 
F(t) = em Tre P29" X (XT (0)] (4.221) 


Each wave vector state q is averaged independently, and the final result is the product over q 
States: 


F(t) = F (À 
CO 
F , — ePQq > e7 Praa (nale PO On) 


Ng =0 


(4.222) 


— „ft pjoqt __ —ioqt 
By(t) = age Age 


For each state q, the trace is merely a summation over all possible integer values of n, 
between zero and infinity. The prefactor is the normalization: 


-1 
OO 
ePOq = (5 ote) = ] — eP% (4.223) 
na~ 
The present notation is simplified by first dropping all q subscripts 
oo iot —iot 
F(t) = (1 — 8) Y eP (pje Ma e-a) g(a" a) ny (4.224) 
n=0 


The state |n} is the one with n excitations and is given in terms of the operators as 


(a*y" 
Jnl 
The first step is to separate the operators in the exponential. Use Feynman’s theorem on the 
disentangling of operators, which is as follows: 


Theorem: If the operators A and B have the property that their commutator C = [A, B] 
commutes with both 4 and B, then 


AtB AB 1/2148] (4.226) 


In) = |0) (4.225) 


This theorem was proved in Problem 5 of Chapter 2. It is used to separate the exponents in 
X(t) and XT(0). To evaluate X(t), set 


A = hat ei (4.227) 
B = hae ™ (4.228) 
[4, B] = -X lat, a] = 2? (4.229) 
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and 
X(t) = AtB = eM eni e ghae (4.230) 
XO) = eet ea (4.231) 
X(HX'(0) = eM eT d prac“! onal o~ha (4.232) 


The result for X'(0) is just the Hermitian conjugate of X(t) evaluated at t = 0. The factor of 
X(t)X'(0) is what is needed in the average in (4.224). 

The next step is to get all the destruction operators on the right and the creation operators 
on the left. The center two operators need to be exchanged. Since they do not commute, this 
exchange will produce another complex phase factor. These two operators are written as 


i 
oh) eM — 


T je T™ eO ha") (4.233) 


The factor in brackets has exactly the form derived earlier in (4.203) where S = —Aa’. 
Consider 


2 


e™ ge” =a- Afat, a] + “lata, al]... (4.234) 
—=a+À (4.235) 

then 
ei phat) oda! — exp[he (a +A) = eh oO (4.236) 


so (4.224) is finally arranged into the desired form: 


ior) 


F(t) = (1 — e Beye 0e (4.237) 


co iat —iot 
x Soe BOM njet 2) emale) In) (4.238) 


n=0 


All the terms with a can be collected together in one exponential since all of these terms 
commute. Likewise for all the terms with at. Next prove that 


(1 —- e 6) D e7 Pro injet e~ “4 |n) — eWlWN (4.239) 
n=0 
1 


eo] 


(4.240) 


where, for our case, u = A(1 — e~), Equation (4.240) is proved by expanding the exponents 
in a power series: 


e™"jn) = D <q! In) (4.241) 
l=0 


Sec. 4.3 e Independent Boson Models 223 


Now recall the properties of boson destruction operators acting on a harmonic oscillator state: 


ajn) = n'/?\n — 1) (4.242) 
a|n) = [n(n — 1)]'2|n — 2) (4.243) 
1 11⁄2 
a!n) = | — In—1) (4.244) 
(n— 1)! ) 
The feature that a’|n) = 0 for / > n is very useful: It terminates the power series after n terms: 
E n (=u n 7? 
uaj yy — —] 4.2 
ein) =P ga) no (4.245) 


Of course, this truncation is why the destruction operators were arranged to the right. The 
other operator may be taken to operate to the left and just produces the Hermitian conjugate of 
the above result: 


* at n (u*)” n! 1/2 
(nje*? = (n — m| (4.246) 
map m! |(n—m)! 
These two results must be multiplied together. Using the basic orthogonality of the states 
(n — m\n — l) = 0,—m=n-l = Onl (4.247) 


produces the compact result 


u*at eoua n (—|u|?)' n! 
nle m = a (M? (n-D)! 


This power series should be familiar to every student of physics: It is just the Laguerre 
polynomial of order n (remember hydrogen wave functions): 


(4.248) 


(nje e™*in) = L (lul?) (4.249) 


The final step is to sum the series over n. This last series is just the generating function of 
Laguerre polynomials: 


A-2 F L (uD = e76- (4.250) 
n=0 


In our case, to prove the theorem (4.240), identify 


z=e Po (4.251) 


When these factors are collected, the result does prove (4.240). The expression for F (t) is 
F(t) =e (4.253) 

do) = X — e™) + NIL — 2/7] (4.254) 

PA = XIN + DA —e™) + NU — ee) (4.255) 
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Return to (4.222) and reintroduce the product over all q states. The function F(t) contains a 
summation of the exponential factor: 


F(t) = IF 6) = a| - > (0 = exp[—()] (4.256) 
q 

M 2 

(t) = sh) [NaC — eta) 4 (Na + 1d — e1a')] (4.257) 
q q 
N, = (4.258) 
a eba] 
The final result for the particle Green’s function for t > 0 is 

G(t) = —ie~#&e-Ve- PO (1 — np) (4.259) 


This equation is the exact result. Next proceed to a description of the physics. 


4.3.3. Einstein Model 


The physics is best understood by examining a simple application of the model. All the 
phonons are taken to have the same energy @ , which is called the Einstein model. The case of 
zero temperature will be discussed first. The modifications for nonzero temperature are 
derived afterwards. For zero temperature, all the phonon occupation factors are zero: 


Ng =Ny = 0 (4.260) 


Furthermore, the summation over wave vector just produces a coupling constant g: 
| y 
g=). — (4.261) 
q Yq 


The Green’s function still has the form (4.259), but now the factors are quite simple: 


A = g@ (4.262) 
M(t) = g(1 — e ") (4.263) 


The particle Green’s function will be evaluated for the case of a single particle, so set np = 0. 
Evaluate 


G(t) = —i@(t) exp[—ite, — g(1 — iœt — e™™®")] (4.264) 


The same result is obtained for the retarded function. The spectral function is then the 
imaginary part of the retarded Green’s function of frequency: 


Aœ) = —2 ml- | dte exp[—ite, — g(1 — im@pt — emt 
0 


= 2 Ret | dte exp[—ite, — g(1 — impt — ey (4.265) 
0 
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The time integral may be evaluated by expanding the ge~’o’ part of the exponent in a power 
series, 


l 
exp(ge ®t) = D (4.266) 
l . 


so that the time integral contains terms such as 
i 


—— 4.267 
W — E + Å — Ool + ið ( ) 


CO 
| exp[it(@ — £. + A — D] = 
0 
The factor ið is added to force the convergence of the oscillating integrand at large values of 
time. Then take the limit 6 — 0 and obtain 
i i 


= P Slo — A — Ql 
© — £, + A — Wo! + ið oe tA ol (© — €, + A — @pl) 


The spectral function is the real part of this time integral, which has just the delta function: 
A(@) = 2re E X 77 — £, + Å — 0l) (4.268) 
1=0 t: 


The spectral function is a series of delta functions, spaced exactly Wp apart. The distribution 
of peak heights follows a Poisson distribution. 

Remember that the spectral function is the probability that the particle has frequency œ. 
If there were no interactions, the particle would always have energy £, and there would be a 
single delta function at œ = ¢,. This limit is obtained from (4.268) by setting g = 0. For 
g Æ 0 the particle has a nonzero probability of occupying other states which have / phonons. 
These configurations are not excited states. Since the temperature has been set equal to zero, 
the solution must be describing the ground state. In the ground state of the coupled system of 
particle and phonon, some probability exists that the system will have the different sets of 
frequencies @, =&€,—A+/@p). The different values of œ; obviously correspond to the 
particle being coupled to some phonons, which is certainly to be expected in this system. In 
the ground state, the particle energy fluctuates among these different values @). 

The spectral function (4.268) is shown in Fig. 4.7 for two different values of the coupling 
constant g = 0.5 and 5.5. These values correspond to the weak and strong coupling cases, 
respectively. Weak coupling is g < 1. Here the l = 1 peak is smaller than / = 0, and higher / 
peaks get smaller very rapidly. For strong coupling, the peak strength increases with / up to 
values of approximately / ~ g, and then the peaks decrease again. 

It is useful to test the sum rules. The first one is [see (3.121)] 


© do g! 
= — —e § = 
1 | an A(@) =e 2 J | 


—0O 


0° l 


dwd(@ — @;) = e* a = | 
oe) I t 


The factor exp(—g) in (4.268) is now recognized as the normalization factor which maintains 
the sum rule. These spectral functions also have the property that 


Oo 
d 
e. = |. 5 0d(0) = (0) (4.269) 
This moment is easily proved by direct evaluation, 


œ d l 
(œ) =| SoA) =e DS (e, —A + ol) = E — A+ gap = E, 
I . 


—00 
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FIGURE 4.7 The spectral function of the independent boson model, Egn. (4.270), shown for an Einstein model and 
two values of coupling constant. (a) For g = 0.5, (b) for g = 5.5. 


which does give (4.269). The last integral is called the first moment. Higher moments (w”) 
may also be evaluated, and they will depend on coupling constant. By using the relation 


OO 


Galt) = -i0( | 2 A(ae™ (4.270) 


— OO 
the moments may also be shown to equal derivatives of the retarded Green’s function: 


n 


n —; d 
(© ) — P Galt) (4.271) 


t— 0t 


Since the retarded Green’s function is nonzero only at t > 0, one takes the time derivatives 
first and the limit of t —> 0* second. 

An inspection of Fig. 4.7(b) verifies that the first moment is independent of coupling 
strength g. The delta function peaks have an intensity envelope which does appear to have a 
maximum near @ % €,. As one increases the coupling constant g, the self-energy A = g@) 
becomes larger in magnitude, so that the lowest energy peak shifts downward in energy. But 
its intensity lowers also because of the factor exp(—g), and the envelope of the delta function 
peaks becomes a Gaussian. A Poisson distribution becomes a Gaussian for large g, which 
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may be shown in the following way. Large g requires large values of /, so that Stirling’s 
approximation may be used for the factorial: 

es /2nl Fe! (4.272) 


In this approximation the Poisson intensity becomes 


ERN l exp| -g +1 +1m($)| 


No 4/2 
-l awole ga! 
= exp| (g I+ tn( 1+ J ) 


ool g—-l\ Ife- N 
Jarl a| te 0 41(&4) -5 (EF) t 


2 
wl exp - -D | (4.273) 


2l 


which is a Gaussian. 

The zero-temperature result was described first, with an Einstein model, to emphasize 
that these additional peaks in the spectral function are really ground state properties of the 
system. For acoustical phonons, one gets a continuous distribution of possible phonon 
energies up to the bandwidth. The spectral function A(@) becomes a continuous function of œ. 
The spectral shapes are dependent on whether the coupling to the acoustical phonons is 
piezoelectric or deformation potential. 

Next consider the Einstein model at nonzero temperature. The phonon contribution ®(f) 
in the Green’s function is 


O(t) = o[(N + DA — ee") + NCI — er] (4.274) 


l 


= -aL (4.275) 


Because of the relationship 
N+1 N+1\'" 
= — ep2. om ) = eP®o/2 (4.276) 
the exponent M(t) may have its terms grouped in the following fashion: 


D(A = g|2Nv 41 — NN + 1(e7iot48/2) 4 eleo(eiB/2)| (4.277) 


The reason for this grouping will now become apparent. The objective is to expand 
exp[—®(t)] in a power series in exp(i@of), in order to evaluate the spectral function. Recall the 
series which generates the Bessel functions of complex argument, J, (z): 


es) E e (4.278) 


l=—oo 
The function exp[—®(t)| has this form using 


z = 2¢,/N(N + 1) (4.279) 


0 = w(t + iB /2) (4.280) 
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The retarded Green’s function at nonzero temperatures may be expanded as 


Galt) = —i@(t) exp[—it(e, — A) — ®(0)] (4.281) 
= —iO(t) exp[—it(e. — A) — g2N + 1)] 
x > I {2g /N(N + 1)je'® +2 (4.282) 
l=—oo 


and its spectral function is 


A(@) = 27 exp[—g(2N + D] à È hf2e /N(N +1 )}e ley B/2 
x 8 — E€ +A — os) (4.283) 


The spectral function contains a summation over the frequencies @, = £, — A+ Wp/ and is a 
delta function at these values. In this sense it is similar to the zero-temperature result (4.268). 
Now the coefficient of the delta function is far more complicated. Another important 
difference is that negative values of / are permitted. Although /_; = J, the factor exp[/Ba)/2)| 
skews the envelope of intensities to positive values of J. 


4.3.4. Optical Absorption and Emission 


The relaxation effects described above can, in some cases, actually be measured. Of 
course, one can never measure the properties of a one-particle Green’s function, as was stated 
in Sec. 3.7. Linear response theory always gives a two-particle Green’s function, which 
describes how the system responds when an external probe causes the system to change its 
state. However, in the many-boson model, there are some two-particle Green’s functions 
which have properties nearly identical to the one-particle properties which have just been 
derived. 

One important model is a localized defect with several possible localized electronic 
states. Each of these states may have a different matrix element for coupling to the phonon 
field: 


H=)> Wy A\a, +9 V; (4.284) 
q i 


V, = CĪC, E +> Mylaq + “| =h,C'C, (4.285) 
q 


This Hamiltonian may also be exactly diagonalized. A canonical transformation of the 
previous form is done below: 


H = He~ 


Mai 
l l q q 


H = 3 ogajaq + X (E; — AC] C; (4.286) 
M3, 

A; = >>—* (4.287) 
q % 
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Terms such as n;n; will be set equal to zero, since it is assumed there is only one electron on 
the impurity. It may be in different states but not in two different electronic states at once. The 
Hamiltonian (4.284) was written with the electronic states not interacting with each other, 
except through the phonons. Any terms which permit a direct interaction between the states 


usually render the Hamiltonian unsolvable, at least exactly. For example, terms such as 


[C1C, + CIC Mg (aq + 44) (4.288) 
q 


are not included. They permit the particle to change its state by emitting a phonon. Such terms 
probably exist in real systems, but their addition to the Hamiltonian makes the problem much 
more difficult. They are customarily omitted. 

In an optical absorption process, an electron may change its electronic state, say from i to 
j, by the absorption of a photon of frequency œ. This process is described by the Kubo 
formula, using the current—current correlation function. As in Sec. 4.3.3, the real-time 
correlation function may be employed. It is convenient to use the version of the Kubo formula 
given in Problem 16 of Chapter 3. For optical frequencies such that Bw > 1, the formula is 


CO 


Rey) =5- | e GO) (4.289) 


where (a, B) are (x,y,z) indices. The relation Bw > 1 is easily satisfied, since typically 
hw, © 2-3 eV is in the visible spectrum, while at room temperature B = 40 eV~!. For the 
transition between two localized states, the current operator is [see (1.194)] 


ja = Pi ali C; (4.290) 
1j 


The matrix element p;; is treated as a constant in this problem. It plays no role in the many- 
body physics which follows. The correlation function is: 


Ga lY O) = E Pi PuB (C OC(OCEC)) 
ijkl (4.291) 
U = (Cİ OCGACİC) = eP! Tre Pe Che cic) 
The Hamiltonian H is that in (4.284). This correlation function may be solved exactly by 
using the same steps that were used to solve the Green’s function (4.201) to get (4.259). First, 
the unit operator 1 = e~*e* is inserted into the trace: 
U =e Tre PH ec] Ce HC Ce Se) (4.292) 
Using the cyclic property of the trace, 
U =e Tree P eci Ceci Ces) (4.293) 


The canonical transformation is taken on each term inside of the trace, which gives 


U = e”? Tr(e PH ehtyi X CIC eH XiX CIC) (4.294) 
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The factors X; result from the transformation of the particle operators: 


C; = &CeS = Ces = CX, (4.295) 
M,; 
_ SE cat 
X; = exp - - o, — a) (4.296) 


The transformed Hamiltonian H in (4.286) is diagonal in the operators C and a,. The time 
development of the correlation function may be found at once: 


eyi y, Choe — x! (1)X,(1)C} Celi At) 
i A;+A A yt t tcc (4.297) 
U = ero SF) Tre PEA (OXOX (0)X/(0)C} CCC 


The electron and phonon parts of the trace may be separated, which is permissible because the 
X; operators do not depend on particle states: 


U = et YUU, y(t) (4.298) 
Ua = Trie ctc ctc (4.299) 
Upy(t) = Trle PX; OXOX OXO (4.300) 


The electron part is quickly evaluated. Using Wick’s theorem, the subscripts referring to 
particle states must be paired. In fact, one must have that j = k and i = l: 


U,, = Tr[e P” CCC] = n (l — n) (4.301) 
UÀ = Tr[e "Xi (OXOX X0] (4.302) 


This result is useful when evaluating the phonon part of the trace. It is similar in form to the 
earlier trace for the one-particle Green’s function. The only difference is that there are now 
four factors instead of two. The evaluation procedure is the same, since the four operators can 
be paired and combined into two. This simplification happens because operators which are at 
the same time can be combined, since their exponents commute: 


X} OXLA — iit (Tees a ) emit (4.303) 
— gilt (TI, eaaa) ov itt (4.304) 
Mu — M, , 
= exp È E (aĵe — age ow ] (4.305) 
q 


It is only possible to add two exponential operators in this fashion when they commute. They 
do in this case when taken at the same time. The correlation function U,„,(t) above is 
immediately simplified to 


U,,(t) = Ig Tr{eP” exp[ -Matea — ageiatyJeMea4a)} (4.306) 
Mı -M 
7 — —_ a ak (4.307) 
Oq 


This equation for U,,(t) is precisely the form (4.222) which was untangled previously, and 
now the effective coupling constant is A = [Max — Mg|/@q.- 
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The previous untangling result (4.257) can be used to obtain the evaluation of U,,,(2): 
U,,(t) = exp{—®,,(t)} 
(My — My) 
®,,(t) — La 
q 


q 


(Na + DC = e7] + Ng — e) 


These various results are collected, and the result for the correlation function for the 
conductivity is 


Re[0,,(@)] = =— m= n(1 — ny )Pid oP, 


©, @) 
x | dt exp{it[@ + £; — £; — A, + A, — ®,,(t)]} (4.308) 
— OO 
The function ®,, in the exponent is similar to the one for the particle Green’s function. The 
only difference is that the effective matrix element is the difference between the two single- 
state matrix elements: My q = Mg, — My. For example, if the two matrix elements M,, and 
Mą happen to be equal, then the effective matrix element vanishes. In this unlikely 
circumstance the spectral function is just a delta function at the frequency @ = €; — £}. 
Usually M,, and My are not equal, at least not for all different wave vectors, so that phonon 
effects are present in the transition. 

The above model describes dynamic relaxation. In the initial state of the system, the 
electron is in a state /, and the phonons are relaxed about their equilibrium configuration for 
the state /. Recall that if the phonon part of the Hamiltonian is written in terms of harmonic 
oscillator coordinates, the phonons relax to an equilibrium configuration given by [see 


(1.245)] 
P l 2 
O= _2 l 4.3 
q (52. ( 09) 


In the optical absorption, the electron starts in state i and ends in state k. The phonons start 
with an equilibrium configuration about the pom Sa oY but end the optical transition with the 
equilibrium configuration about the point Of The phonon system must alter its equilibrium 
configuration during the transition. This change is a relaxation process, since it must relax to 
the new equilibrium configuration during the optical step. The physics problem is to deter- 
mine how the phonon relaxation process affects the absorption spectra. This information is 
contained in the result (4.308). 

The process is indicated schematically in Fig. 4.8, which shows a potential energy 
diagram for each oscillator coordinate Q,. There are two parabolic curves, with parabolicity 
o . The lower curve describes the ground state of the system. The electronic energy is g;. If 
there were no coupling to the particle in state /, the phonon parabola would be a minimum at 
this point Q} = 0. However, because of the coupling M,, to this particle, the potential 
minimum is at oP. The upper curve describes the final state potential energy curves of the 
phonons plus particle. The particle energy is ¢,, and the curve crosses here because the 
phonon system has this energy when Q, = 0. This potential energy curve has the minimum at 
ow. ). This minimum has been drawn on the other side of the axis to emphasize that it is 
usually a point different than the ground state minimum. Figure 4.8 is called a configurational 
coordinate diagram (Williams and Hebb, 1951). Such diagrams were originally constructed 
with a single-particle coordinate Q, which represented the real atom displacement of the first 
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FIGURE 4.8 Configurational coordinate drawing of the independent boson model. The two parabolas represent the 
phonon potential energy of the initial and final electronic states in the transition. Transition path A is most likely, 
while path B is less likely but gives the zero-phonon probability. 


shell of atoms about the impurity. However, the use of configurational coordinate diagrams is 
much more rigorous when interpreted as the potential energy of the individual phonon modes. 
The physical question is whether the optical transition happens vertically on this diagram 
or along some other trajectory. For example, the arrow B is from one potential minimum to 
the other. This transition is called the zero-phonon line. In reality, all these transitions are 
possible, and each has a probability of occurrence. These probabilities are given by (4.308). 
The physics is well illustrated by again using the Einstein model for phonons. This formula is 
expanded in the same manner used to derive the one-particle Green’s function (4.283): 


T _ 0° , 
Re Sup) ~ O 2 nQ 7 Nx )P kd oP k1,Be u(t) >, (0) ~~ Om in Yee Booo/2 
m=—0o 


— 2 
gy(T) = (2N + 1)gu = (2N + 1) yu Mu 
a q 


(4.310) 


O,, = & — E; —A, + A; + mp (4.311) 


Yu = 28u vV NN + 1) (4.312) 


This formula for Re[o(@)]| has exactly the same form as the one-particle spectral function 
(4.283), as an expansion in Bessel functions of complex argument. Now the coupling constant 
to the phonons is determined by My, — My. If My, = My, then g = 0, y = 0, and the 
conductivity is a single delta function at m = £; — £. 
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The zero-phonon line is given by the term with m=0. It has the energy 
On—0 = E — E; — Ay + A), which corresponds to the B arrow in Fig. 4.8. The probability of 
this transition is proportional to 


euD] (Yy) (4.313) 


The envelope of delta function heights is determined by the m dependence of the factor 
L(Y) eXp(mB@)/2). For coupling constants gy greater than unity, the delta function inten- 
sifies with m and then falls off at higher values. The strongest delta function occurs at positive 
nonzero values of m, which is illustrated in Fig. 4.9. The circles show the peak heights 
calculated for gyu = 4 and N = L. The points are the emission spectra, which will be derived 
below. 

The usual interpretation is that terms with m > 0 correspond to the creation of m 
phonons during the optical transition. The zero-phonon line is just the transition where no 
phonons are created or destroyed. Transitions in which m > 0 are phonon emission, and 
transitions in which m < 0 are phonon absorption. The phonon absorption process is possible 
at nonzero temperature, when phonons are thermally present in the initial system. This 
interpretation ignores the fact that both the ground and excited states are an admixture of 
phonon states. The spectral functions (4.283) of both initial and final states contain admix- 
tures of different numbers of phonons. These differing admixtures are due to fluctuations in 
the system. Sometimes the particle has one phonon around it, while other times it has three or 
four, etc. The probability of different fluctuations is given by the Bessel function coefficients 
in (4.283). 

The other interpretation is that the absorption spectra of Fig. 4.9 correspond to transi- 
tions between states of different fluctuations. The initial state may be in a state with a fluc- 
tuation of plus four phonons and the final state with plus six. It takes a net of plus two phonon 
energies to complete the transition with energy conservation. This model is not really 
different from the above interpretation. The phonons in the fluctuation still have to dissipate 
away. However, one state may have a different number of average phonons, in its fluctuating 
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FIGURE 4.9 The emission and absorption spectra for an Einstein model, in the independent boson approximation, 
from Eq. (4.239). The o’s and x’s mark the peak intensities of each phonon sideband. 
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cloud, than the other. The number of phonons which must be created or destroyed to make up 
this average is not really creating a net amount to dissipate into the system. They will, instead, 
stay at the impurity site and take part in the fluctuations. 

The emission spectra will now be considered. It is assumed that the particle has been 
elevated to an excited state at an earlier time, perhaps by an optical absorption process. 
However, experimentally it is often done by electron bombardment. The phonons in the 
excited state relax around the particle and eventually attain the equilibrium configuration plus 
fluctuations. This assumption is valid only for long-lived excited states. Then a photon is 
emitted while the particle drops to a lower energy state. Of course, the phonon system must 
adjust during the emission process just as it did during the absorption. The same relaxation 
processes are encountered again. 

The emission spectra is derived without doing any more work. It may be obtained from 
the absorption result by a simple argument. The emission is the rate per unit time that photons 
are produced in the solid. Of course, the absorption is the rate at which they disappear. Write a 
simple rate equation for the average number of photons N of frequency œ, 


dN 
— = —wN 4.314 
da. (4.314) 
which has the solution 
N(t) = Ne” (4.315) 


where w is the rate of absorption. Since the conductivity o(œ) also has the units of s7}, one 
might expect that w œx o. In fact, the exact relationship is 


4 
wo (4.316) 
Ei 
where g; is the square of the refractive index. The rate of photon absorption is just propor- 
tional to the real part of the conductivity, as given by the Kubo formula. Another way to 
understand this is to consider rewriting the Kubo formula with some additional factors: 


o(@) > (1 — e=) | dte'™ ( j(t)b'(t) j(0)b(0)) (4.317) 
—00 
The two additional factors are the creation and destruction operators for photons b and bt. 
The right-hand part of the correlation factor, 


j,(0)b(0) = P PuaC}Cib (4.318) 


describes the process whereby a photon is destroyed (by being absorbed) and an electron has 
its state changed from / to k. This term arises from the p: A interaction. The left-hand part of 
the correlation function contains the factors 


MOMOE > puC ACODA (4.319) 


which describe the inverse process at another time t. The correlation function between these 
two events is the Kubo formula. The photon parts may be factored out of the expression using 
bi (t) = btd and (btb) =N 


palo) NC = eP) dee (0) (0) (4.320) 
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This quantity is, except for the factor of 47, just the right-hand side of Eq. (4.314). It is the 
rate that photons are destroyed in the sample by optical absorption. 

For emission, calculate the rate at which photons are created. The first step in the 
correlation function would describe a photon being created while the electron changes its 
state. This correlation function should be 


| Al j(B(0)j(0)5"0) (4.321) 


where the prefactor (1 — e~®) is dropped since it is near unity. The first operator on the right 
b? creates a photon as part of the emission process. Take these photon factors out of the 
integrand, using b(t) = be~*™ and (bbt) = N + 1, to obtain 


(ob) | dre OO = ON + nao (4.322) 


This result, multiplied by 47, must be the rate that photons are being made in the solid: 


dN 
P7 =I(N +1) (4.323) 
I(@) = an| dte ™®™]{ Ip) Ju(0)) (4.324) 


The formula for /(@) is identical with the Kubo formula except that the frequency has 
changed sign. Compare this formula with the Einstein model result (4.310), for example, and 
only change the sign of to get 


Io) x eD O Slot Opp (yete (4.325) 


M==— OO 


The outcome is very simple. The emission spectra is just the mirror image of the absorption 
spectra reflected across the zero-phonon line. The emission is illustrated by the points marked 
X in Fig. 4.9. The emission spectra are mostly on the low-frequency side of the zero-phonon 
line. In general, the emission spectra have a lower average frequency than the corresponding 
absorption spectra for the same processes. The difference is a consequence of relaxation, 
which is illustrated using the configurational coordinate diagram of Fig. 4.10. The vertical 
arrow “abs” shows the most likely absorption event. Of course, other absorption events are 
possible, but the peak of the envelope of delta functions is at this vertical transition. 

Similarly, the most likely emission event is the downward arrow marked “emiss.” It is 
shorter than the absorption arrow, which indicates that the average emission takes less energy. 
No matter how one draws those two parabolas with respect to each other, the downward 
emission arrow is always shorter than (or equal to) the upward arrow. This fact can also be 
demonstrated by using the various theoretical formulas which have been derived. However, 
the simple diagram of Fig. 4.10 is probably the clearest proof that the emission has an average 
lower energy than absorption. 

The lower average energy in emission, compared to absorption, is understood using the 
first interpretation mentioned after (4.310). During absorption, one usually makes phonons. 
The photon energy is given by the change in particle energy plus the phonon energy 


œ = A(particle energy) + phonons (4.326) 
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FIGURE 4.10 Configurational coordinate drawing shows that the average absorption frequency is higher than the 
average emission frequency. 


This energy is higher than the particle transition energy. In the emission event, the energy 
comes from the particle transition, but here too one makes phonons. Energy conservation is 


A(particle energy) = œ + phonons (4.327) 


and the average photon energy œ is less than the particle transition energy. 

Figure 4.11 shows an experimental result for absorption and emission at an impurity 
center in ZnTe, reported by Dietz et al. (1962). The emission and absorption are mirror 
images about the zero-phonon lines. The arrows at the top show the separation of the optical 
phonon lines, which are very clear in the emission spectra. The two other peaks between 
successive LO phonons are sidebands due to LA and TA phonons. These data were taken at 
20 K. It is an unusually good example of the relation between emission and absorption. 


4.3.5. Sudden Switching 


There is another way to derive the result for optical emission and absorption. Of course it 
obtains the same answer, since both derivations are exact. This other derivation emphasizes 
the switching aspects of the problem. It is just an alternate method of obtaining the same 
answer but perhaps provides a slightly different physical insight. The many-body problem is 
presented as the system response when a potential is suddenly switched on. Return to the 
Kubo formula and consider the time correlation function: 


U(t) = Tr(e PH ect Ce CIC) (4.328) 


In writing this expression, the subscripts of the particle operators are paired up. Now the 
Hamiltonian H occurs at three places in this expression. It has two different forms: one for 
each different place it occurs in the sequence of operators. The factor on the right is 


eHtCTC, (4.329) 
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The operators ci C; act to the right. They destroy the particle in state / and create one in state 
k. The operator H acts on a system with a particle in k but not in /. According to the form 
(4.284) for the Hamiltonian, 


H =H, +} h;CİC, (4.330) 
H, = 2 Ogala (4.331) 


When H operates on a state |k) = CŻ10) then H|k) = (H, + h,)|k). In this case, e~h ctc, 
may be replaced in the operator sequence by 


ecic, = e ME tWCTC, (4.332) 


This result is exact, and no approximation is involved in the replacement. The other two 
factors of H are at the left of the operator sequence. On their right is the immediate sequence 
of particle operators, 


e BH eit CIC, (4.333) 


which returns the particle back to the state /. These two Hamiltonians operate on the system 
with a particle in the state / and therefore produce 


e BH, +h) eit, Hh CTC, (4.334) 
The correlation function U(t) in (4.328) is exactly equal to 
U(t) = Tr(e PEt) git +h) CTC, eH tho) CTC) (4.335) 


The particle operators may now be removed from the trace. The time development of the 
correlation function involves only the operators H, and h,, and these contain only phonon 
operators. The particle operators have done their job, in the correlation function, by deter- 
mining which effective Hamiltonians operate at various points in the sequence. There remains 
only to evaluate the phonon part of the trace: 


U(t) = el n(1 — ny)U, p(t) (4.336) 
U(t) = Tr o BUH, +h) pit, +h) eH, +h) (4.337) 
hy = Mg (aq + 21) (4.338) 

q 


This correlation function could be evaluated by Feynman disentangling. However, another 
method will now be developed. 

When a Hamiltonian is solved in the interaction representation one writes H = Hy + V, 
where Hy is a part one can solve and V is the perturbation. Here the procedure is slightly 
different. The Hamiltonian H, is for the state with a particle in /, 


Ay = H, +h, (4.339) 

and H, + V is the Hamiltonian for the particle in the state k, 
Ho + V =H, +h, (4.340) 
V = h; — hı = £ — £ + XMa — My)(aq + 44) (4.341) 
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The perturbation V is the difference between these two Hamiltonians. The phonon part of the 
correlation function may be written as 


Us, (t) = Tre PH elo e-+) (4.342) 


The next step is to make Ho a diagonal Hamiltonian, which is not difficult, since the unitary 
transformation is given earlier in (4.286): 


Hy = e' Hye! = £; — A,;+ >> Wg Aq 
q 
Hy +V = (H + V)e™ =e, -A+ Y Ogalaa 
q 


M., — M,,) 
+ y ae M + > (Mak 7 Maag + al) 
q 


; - (4.343) 


My 
S] = 2o (ay — aq) 
q q 


(Mar — Mai) 
Ay = } 


q Oq 


The transformed potential V is obtained by subtracting Họ from Hy + V: 
V =e,—¢,—-A, +A, +Ay+5V (4.344) 
ôV = X (Myx — Mgy)(Gq + 24) (4.345) 
q 


The unitary transformation in (4.343) is applied to all the operators in the correlation function 
(4.342). It is done following the same steps used to transform (4.291), except that here only 
exp(S,) is used, 


U, (Ò) = Tr(e PH e"o eit(Ho+1)) (4.346) 


where Hy now has the diagonal form in (4.343). 

This correlation function has the form of the switching phenomena, as mentioned at the 
beginning of this subsection. It describes the response of the system to suddenly switching on 
the potential V at time t = 0. This switching on causes numerous transients in the phonon 
system, and these transients are the cause of the phonon sidebands observed in absorption. 

To illustrate this switching, consider a system which has a Hamiltonian given by 


H=H,+Vad'd (4.347) 


where d? and d are creation and destruction operators for some particle. The Green’s function 
of the d particle is given for (t > 0) 

G,(t) = —i(O|e"“de~ a" 0) (4.348) 
The particle d is created at time t = 0. Then et operates on the state with a particle in d and 
Hd" \0) = (Hy + V)d"|0). At a later time ¢ the particle is destroyed by the destruction operator 
d, and e” operates on a state with no d-particle. The Green’s function for t > 0 is 


G,(t) = —i(dd*y (e t e7" + (4.349) 
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which is exactly the form of the correlation function (4.346). All the d particle did for us was 
to switch on the potential V at t = 0 and switch it off at time t. This switching-on causes the 
phonons to respond and adjust to the new potential V. These transients are the same 
relaxation processes which were discussed earlier. The correlation function (4.346) is eval- 


uated by first recalling the interaction representation result (2.1.7): 


— to 

eote" +V) — r exp| -i| dt, rw) (4.350) 
0 
In this case, the operator V(t) is 
V(t) = eH Veio (4.351) 
= &, — &] — A, + A; + Ay + 5V(t) (4.352) 
SVA = (My — My age" + aţi) (4.353) 
q 


The constant terms may be immediately removed from the correlation function. Only the 
operator part ÒV (t) needs to be time-ordered: 


- - - p 
eote "t — exp[—it(e, — £; — Ay + A; + Ay)]T exp -f an5) 
0 


Now SV (t,) contains only two terms, which are, respectively, proportional to a‘ and a. Since 
the commutator of these results is a constant, one immediately considers evaluating these 
time-ordered exponents by using Feynman’s theorem (4.226). It is not correct to use precisely 
the form of (4.226). The two terms in the exponent operate at different times, and this fact 
must be considered. The proper procedure is done in Problem 5 of Chapter 2, which is to 
separate the two terms in the time-ordered exponent. Again it is done for a single value of 
wave vector q, and this subscript is suppressed. The equations are: 


Á t 
T exp j- l dt, arn) = PAKO) exp |- l dt, ee Oaeei | 
0 0 


i , À , 4.354 
o(t) — -a | dt," — —(1 — e™) ( ) 
0 Q) 
A = (M; — M;) 
The time dependence of the exponent in the integrand is 
eT get) — a+ H(t) (4.355) 
This time integral may now be evaluated and gives 
d . A , 
— a| dt,e"™ |a +—(1- | (4.356) 
0 A) 
A N a 
— —q = (1 — e” t — — — (1 —e™ 4.357 
a—(1— 6) + i —— (l e) (4.357) 


The phonon correlation function is now 


2 
U p(t) — ete, —€;—A, +A) exp bet _ om) Tr(e BH e” tO gab") 
q ~q 
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Note that the factor Ay has canceled out of this expression. The trace is evaluated using the 
previous result (4.240). The normalization factor exp[BQ] has not been written explicitly in 
these equations but obviously should be included. The final result is 


U,,(t) = exp[—it(e, — €; — A, + A;) — O,(0)] 


— 2 4.358 

vul) = ZEA O, + 1 — eH) + NGI e] mo 
q q 

which is precisely the earlier result in (4.308). 

The following physical picture emerges from this derivation. The coupled system of 
particle and phonon has been sitting in a state, say /, where the phonons have been fluctuating 
around the equilibrium position. Suddenly the particle is moved to state k by the optical 
absorption process. From the point of view of the phonon system, it appears that the potential 


ÒV = (Mg, — My)lag + 24) (4.359) 
q 


has been suddenly switched on at that time which is taken to be t = 0. The transient response 
in the phonon system is measured by switching off the potential at a later time and measuring 
the correlations which result. The resulting function of time describes the temporal evolution 
of the phonon system. Its Fourier transform gives the phonon sideband structure which is 
observed in the optical spectra. This picture explains the mirror relation between emission and 
absorption. In absorption the potential which is switched on is (4.359), while in emission, if 
the particle transition is in the opposite direction, the potential which is suddently switched on 
is just the negative of (4.359). 


4.3.6. Linked Cluster Expansion 


Another way to evaluate the time-ordered exponential operator 


U(t) =T (p=: | dt, r) (4.360) 
0 


is by the linked cluster expansion, which was discussed in Sec. 3.6. It is sufficient to derive 
the result for a single-phonon state of wave vector q. The summation over the q vector in the 
exponent is easy to do at the end. Consider the case where V(t) is written as 


V(t) = A(t) (4.361) 
A(t) = ae~™ + altel! (4.362) 


The S matrix is expanded directly, and each term is evaluated. The series is presumed to be an 
exponential series. In the present example, there is only one term which remains after the 
resummation. The nth-order term is 


U(t) = (i U,(t) (4.363) 


u(t) = dt; | dt,---| dt,(TA(t,)---A(t,)) (4.364) 
n! Jo 0 0 
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Since A(t) describes the creation or destruction of a phonon, the A operators always exist in 
pairs. Only terms with n even are nonzero; the others are zero. Since n is even, define n = 2m: 


UO = Yo (i) Vay (4.365) 
m=0 
42m t t t 
Urnll) = Tom | dts | da | tag TACH) Allon) (4.366) 


According to Wick’s theorem, the A operators pair up to form phonon Green’s function 
(TA(t, )A(t,)) = iD(t, — b) = (N + De —2! + Nell! 
1 (4.367) 
N= ebo — | 


Each phonon Green’s function depends on the difference of two time variables D(t; — t;). The 
time integrals over those two variables define a function 


o(t) = if dt, | dt, D(t — ty) (4.368) 
0 0 
= Z (N +D -e™)+N(1-e™)) (4.369) 


The mth term has m phonons: 


(TAC) n - A(lm)) = » D(t E t)DG ~ t;) n -D(t, _ tom) 


all combinations 


The quantity U,,,(t) is just proportional to ọ(t)”, since each combination of time-ordered 
products gives this same result. The only remaining question is the combinatorial determi- 
nation of the number of different arrangements. The number of such combinations is 


(2m)! 
2mm! 


(4.370) 


which is obtained in the following way. The first variable, t;, may be paired with any of the 
2m — 1 other variables. The next time variable, which is ¢, if it was not paired with ¢,, is 
paired with any of the other 2m — 3 variables. The number of combinations is 


2m)! 
(2m — 1)(2m — 3)---3°1 _ | m) (4.371) 
2mm! 
The result is 
42m f m 
Umt) = — D) (4.372) 
m! | 2 
which is summed to obtain the correlation function: 
2 
U(t) = exp - p %0] (4.373) 


The function 4/2 is now recognized as the exponential function of time which has already 
been derived several different ways. It is just the double integral over the phonon Green’s 
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function. The linked cluster expansion is obviously simple, since there is only one distinct 
linked cluster, which is (7). 

Another way to evaluate the Green’s function is to leave the particle operators in the S- 
matrix expansion. There are occasions when it is not immediately apparent that they can be 
factored out, as was done in (4.328), to give the starting point for (4.346). There are several 
exactly solvable models which can be evaluated this way. It is useful to learn the technique. 
Consider the Green’s function for a single particle in a band, which is coupled to phonons. 
The subscript q is suppressed again in the notation: 


H=H+V, Hy=aCiC+oa'a (4.374) 

V = ACİC(a + at) = ACÎCA, A=a+at (4.375) 
G(t) = —i(TC(t)C'(0)) = —i0(t)(e#*' Cte- C) (4.376) 
GO) = iON e E (4.377) 


Since there is only one particle in the band, the creation operator must be to the right, so 
t > 0. The unperturbed Green’s function G® (f) has a simple form. The interacting Green’s 
function is written, following Chapter 2, as 


GO = —i(TC()U(1)C'(0)) (4.378) 


The U(t) matrix, or S matrix, is now expanded in an infinite series of terms. This infinite 
series will generate the linked cluster expansion. However, now the particle operators Ct and 
C are included, along with the phonon operators a and at. Again only terms with n even are 
nonzero, and the 2mth term is 


_ 2m et 1 a A a 
Gy,(t) = -00 T | dt, | dtn (TADA) --- AC tom) 
x (Tİ) -< Ch (tn) C(t) CT (0)) (4.379) 


Now there is a time-ordered product of particle operators. At first the time-ordering appears to 
complicate the evaluation procedure. After all, this time-ordered product of operators, 


W(t, tiso, bom) = (TOC! (H) «Chan Cam) CT (0) (4.380) 


must be expanded, according to Wick’s theorem, into all possible pairings. However, a careful 
inspection shows that the time-ordered product has a trivial evaluation. The number operator 
C'C is time independent in the interaction representation since Hy commutes with the number 
operator, 


CINCU) = ect Ce = CIC =n (4.381) 
so that the time-ordered correlation function just contains a product of number operators: 
W(t, ty, .. , tom) = (Tnt (0) = (TC(HCT(0)) (4.382) 


The number operator gives unity when operating upon the state with one particle. The W 
function does not actually depend on any time variable except t. The term remaining, on the 
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right-hand side, is just proportional to G(¢). Since this same factor occurs in each term in 
the S-matrix expansion it may be factored out of the series: 


Gt) = E Galt) 
m= (4.383) 


_ 2m pt t a a a 

Gayl) = -i000 TO | ty | dran (TAEDA Atay) 
(2m)! Jo 0 

What remains in the series is just the same linked cluster expansion which was evaluated 

above. The exact Green’s function is, again, our result which is now very familiar: 


2 
G(t) = GOA) exp -20 (4.384) 


The presence of the C operators in the linked cluster expansion did not, in this case, change 
the result. Usually that does not happen. For most Hamiltonians, the presence of the C 
operators changes the evaluation of each term in the S-matrix expansion. These cases are 
often not exactly solvable and certainly are not by the method under discussion. 

The usual polaron problem, for one free particle in a band, is described by the Hamil- 
tonian derived in Sec. 1.3: 


H=% ex CiCy + Llogatag + Mp4 P(A) (4.385) 

q 
A= aata a (4.386) 
pq) = > Chg (4.387) 


The coupling between the particle and phonons depends on the particle density operator, 
pla, D = E Ch yq Cpe rte (4.388) 
k 


which is time dependent. If one were to attempt to solve for the particle Green’s function by a 
linked cluster expansion, the particle part of the correlation function becomes far more 
complicated. This Green’s function is 


Gik, t) = iO (A) (e Ce Ct = iO e Ce Ht Ch) 


V = E MAPO. (4.389) 
q 


When evaluated by expanding the S matrix, it produces the 2mth term of the same general 
form as (4.379): 


l (—i)*” t t 
Go,(k, t) = —10(1) (2m)! | dt; +: | dtom 2o 2 Mg, Mg, Mg, 
° q; Qam 


x (TAa (ty) +++ Ag, (tom) TOPA, t1) PAm tam)CT(0))—«(4.390) 


Now the particle part of the correlation function depends on products of the density operators 
at different times. Usually this function does not have a simple time dependence, and the 
polaron problem is not exactly solvable. One may still evaluate it by a linked cluster 
expansion. But this process now becomes an approximate procedure, whereby one evaluates 
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only a few terms in an infinite series, and the remaining terms are omitted. This approx- 
imation will be discussed in Chapter 7. 

Van Haeringen (1965) proved the theorem which specifies the most general possible 
conditions for which (4.389) may be solved exactly by the linked cluster method. An exact 
solution is obtained whenever p(q, £) has the form 


p(q, £) = e pq, 0) (4.391) 
(4, 0) = PQ) = Lrg (4.392) 


Since the general time dependence is given by the kinetic energy difference f(k, q) = 
Ex+q — €x, his condition is that this difference is independent of k: 


f(Q) = Ek+q — Ek (4.393) 


If these conditions are met, the exact solution to the Green’s function may still be obtained 
from the linked cluster method. Since the factor exp|it/(q)] is a c number, and not an operator, 
it may be removed from the time-ordered product of particle operators: 


A A 2m A A 
(TC(t)p(qy, t1) Pam» tzm) C? (0)) = exp f 2 tif @) (TCP) «+ + P(Gam)C*(0)) 
J= 
(4.394) 


This step leaves, in the time-ordered product, just the time-independent density operators 
p(q). These operators have the same effect that the number operators did previously. They do 
not change the value of the correlation function. It is exactly equal to 


2m 2 


exp : tf J (TC(A)CT(0)) = iG (k, £) exp i D tf J (4.395) 
j=l j=l 


The validity of (4.395) may be shown in the following way. The effect of p(q) on Ct 0) is just 
to change the particle from the state k to k + q: 


p(a)Cp10) = $ CÈ gC C10) = Ch, 410) (4.396) 
k’ 


The effect of a product of such density operators is to change C}|0) by a summation of all 
their q vectors: 


p(q:)P(G2) -© P(Gam C10) = Ch IO) (4.397) 
Q=k+} q =k (4.398) 
J 


(TPCA, ty) -+ P(Gam> tom) C(O) = (TOC (0)) = iGO, t) 


The result requires that the summation over all of the phonon wave vectors is zero. This 
cancellation is required by the phonon part of the correlation function. When this correlation 
function is evaluated, by pairing according to Wick’s theorem, the pairing forces the q values 
to be equal and opposite in sign: 


(TAg (i)Aa (6) = 8q=-4, (TAq,(4i)A-g,G)) = iðq, =- P(0;, fi — t;) 
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The summation of all the q values is certainly zero, since they are paired in sets of q; and —q,. 
The assertion (4.395) is proven. 


The extra time factors exp(itf) go into the evaluation of the linked cluster expansion. 
Define a new function: 


1 , Á , 

b(A =- M2 | dt, | dt,iD(q, t, — tjet OHA- (4.399) 

J 2 q 0 i o 2 
q 


It is evaluated for the case f(—q) = —f (q), which is the interesting physical case discussed 
below: 


Og 


M2) — vy, p ieee | oy fice 4.400 
t) = n aooo , 
vO = Naga pa | ar E 


Terms with @, — f (q) are identical to those with @, +f (q) by the variable change: q > —q. 
The exact result for the Green’s function (4.389) is 


G(k, t) = -iO (He E eT &O (4.401) 


Equation (4.401) is the most general form for G(k, t), which may be exactly solvable, 
according to van Haeringen. 

What kind of systems will obey his condition that €,,4 — & =/(q)? One possibility is 
that the energy is a constant, €, = & so that f = 0. It is this example which has been 
repeatedly worked throughout this section. However, there is another case which is now 
recognized as an exactly solvable model. It has the particle moving with a constant velocity v, 
so that its energy is 


& = vk (4.402) 
f(q) =v: q = —f(-q) (4.403) 


It should be remarked that the system of equations is not Galilean invariant. Even though the 
particle is moving with constant velocity, one cannot transform the equations into a system 
where it is standing still, in a stationary system of phonons. The phonons, particularly the 
acoustical phonons, have their own velocity and are not invariant under a Galilean trans- 
formation. Instead, the phonon frequencies appear Doppler shifted to the particle. The particle 
Green’s function could be evaluated for the case that it was going faster or slower than the 
acoustical phonons system. Presumably the phonon relaxation around the particle will vary 
considerably between these two cases. This model is illustrated in the assigned problems. The 
one-dimensional constant velocity model for polarons was solved by Engelsberg and Varga 
(1964). 

In the polaron problem the particle-phonon coupling produces fluctuations in the 
number of phonons surrounding the particle. This number fluctuates from time to time and 
from particle to particle. In the independent boson model, each phonon, in this fluctuating 
cloud around the particle, is assumed to exist with a probability which is independent of 
whether other phonons are also simultaneously present. Each phonon is fluctuating inde- 
pendently of the others. If a particle is fixed, after emitting a phonon it is still fixed. The 
probability of emitting the second phonon is the same as the probability of emitting the first. 
When the probability of each emission or absorption is the same, then one has the inde- 
pendent boson model. Of course, a particle moving with a constant velocity v also has its 
motion unaffected by how many phonons have been emitted. Again the probability of 
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emitting each phonon is independent of others which may have been emitted, so that all 
phonon emissions are independent. 

A free particle with kinetic energy £, = k*/2m is not described by the independent 
boson model. After emission of one phonon with wave vector q, the particle goes to an 
intermediate state with wave vector k + q and energy (k + q)”/2m. The intermediate state is 
different from the starting one, so that the emission of the second phonon has a probability 
that is different from the emission of the first. Each emission has a different probability, so 
that the probability of n phonon emissions is not just a Poisson distribution, 


P, =e% (4.404) 


where « is the probability of a single emission. The Poisson distribution is the characteristic 
zero-temperature distribution of phonon sidebands only when each emission has the same 
probability, which is independent of the number of other phonons emitted. 

The independent boson model has been used widely, with a number of variations. 
Almbladh and Minnhagen (1978) solved the Fano—Anderson model with phonon coupling to 
the localized level. A number of related models have been reviewed by Cini and D’Andrea 
(1988). 


4.4. BETHE LATTICE 


The terms Bethe lattice and Cayley tree both refer to a type of lattice in which there are 
no paths which are loops. Each site has z nearest neighbors, where z is an integer greater than 
one. Since z = 2 is a one-dimensional chain, one usually takes z > 2. Figure 4.12(a) shows 
the Bethe lattice for z = 3, where a site is at each vertex. Starting from the central point, there 
are three neighbors. Each of those neighbors also has three neighbors. Of course, each point 
has the same symmetry. 


4.4.1. Electron Green's Function 


The Green’s function for the Bethe lattice will be calculated for the Hamiltonian with the 
nearest neighbor hopping term 


H=-y CCa -uI CC, (4.405) 
J, j 


The summation over 6 runs over the z neighbors. Spin plays no role in this calculation and 
will be omitted. The last term contains the chemical potential u. So how does one calculate 
the band energy of a lattice without a regular crystal structure? The solution was given by 
Brinkman and Rice (1970). 

As usual, the Hamiltonian is written as H = Họ + V. In this case set Hy) = —uN. The 
hopping term is put into V. Define the Green’s function %(l, ip,,) as that needed to advance / 
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(b) 


j= 


j= 


j=0 


FIGURE 4.12 (a) Bethe lattice for z = 3 as usually drawn. (b) The same lattice with a boundary. 


steps in the lattice. Since Hy) = —uN the noninteracting Green’s function for I = 0 is given 
below: 
Bo 
Al, ip) =- | drett tT COTO) (4.406) 
0 
(0) P aipat t l 
G (0, ip, =-| dre” (T_.C(t)C; (0)) = - 4.407 
0 ip) =- | dteP(T.C()C}) = (4.407) 


Every site has the same Green’s function, so there is no need to put on a label such as “j”. 
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The self-energy (0, ip,,) is found using perturbation theory. Since the interactions are 
simple, it is possible to find the exact expression. The self-energy is found from processes 
where an electron hops to its neighbors and then hops back. Each hop has a matrix element t. 
The total number of hops in each self-energy is an even number such as 2n. For 2n hops, 
which return the particle to the original site, the self-energy is denoted as £”. The lowest- 
order self-energy (n = 1) is from the process where a particle hops to the neighbors and then 
hops back to the original site. There are z neighbors, and the intermediate Green’s function at 
the neighboring site must be g®(0, ip,). The self-energy from double hops is 


zt? 
ip, +H 


£0, ip,) = (4.408) 


The next term in the self-energy comes from the fourth order of perturbation theory, and 
includes terms where the particle hops twice to a second-nearest neighbor, and then hops 
back. The first hop has z choices, but the second only has z — 1 since the hop back to the 
starting point has already been included. This term is 


— 1) 
£0, ip,) = E5 (4.409) 
(ip, + 4) 
A better way to write the self-energy, including both terms, is 
ADAN : zt 
Z` <e (0, ip,) = CDA (4.410) 
iP, +u- 
ipa +H 
2 
-Æ j ED (evr) 4... (4.411) 
iatu] (ipata Npn +W 


The terms in the series in brackets describe processes where, after hopping to the first 
neighbor, the particle hops to-and-from its neighbors multiple times before hopping back to 
the original site. It can only hop back to the original site once. When it hops away from the 
original site the second time, it counts as another self-energy function. Equation (4.411) 
includes terms which are higher order in the perturbation expansion. 

The factor multiplying z/? in Eq. (4.410) has the appearance of a Green’s function. It is 
the Green’s function for the particle as it hops around. The same Green’s function should be in 
its own denominator. This process leads to a continued fraction 


; zt? 
X(0, ip,,) = GD (4.412) 
ip, tu- z- DË 
ip, +u- ——— 
ID, + H— >. 


Every denominator is the same. Further terms in the continued fraction describe processes 
where the particle hops further away in the lattice. The continued fraction contains all forward 
hops to all sites. This expression is the exact self-energy. 
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The self-energy (4.410) can be solved in a simple way by defining the self-energy in the 
denominator as Xp 


(z-De 
= — 4.413 
P iP, + H — Xp i ) 
The continued fraction yields a quadratic equation which is solved easily 
0 = E? — (ip, + WEp + z- 1)? (4.414) 
Zp = 5 i, + H — y (ip, + u)? ~~ A(z — De] (4.415) 
5(0, ip) =- —— (4.416) 
Pn ip, t—Xp | 
2 
2 (4.417) 
ip, + u + y (ip, + BÝ — 4- DP 
1 
G(0, ip,) = (4.418) 


ID, + LL — X(0, ipn) 


Eq. (4.417) is the exact self-energy, and (4.418) is the exact Green’s function, for the Bethe 
lattice in the hopping model. 

The self-energy can be used to calculate the density of states. First, recall how it is 
evaluated for the tight-binding model of a regular crystal. The density of states in d 
dimensions for a crystal is 


d 
N(E) = n ; 5(E — e(k)) (4.419) 
e(k) = =t} e™ `? (4.420) 
ð 


This expression is also just the imaginary part of the retarded real space Green’s function 
Get(r, E) evaluated at r = 0: 


d¢k gik-r 


G(r, E) = | On) E — elk) + in (4.421) 
—2 Im{G(r = 0, E)} = 27 | atk 8(E — &(k)) (4.422) 
| (2n)" | 


= 2nN(E) (4.423) 
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For the Bethe lattice, the above Green’s function is in real space at lattice spacing equal to 
zero. Obtaining the retarded function (ip, +p — E + in), and taking its imaginary part gives 


TN (E) = —Im[G,.(0, E)| = -m| E or (4.424) 
2 [F2 _ F2 

nN(E) = NE r O(E? — E’) (4.425) 

E? = 2z? (4.426) 

E = 4z- 1)? (4.427) 


It is useful to renormalize the energy by Q = E/(zt) and to write the result in dimensionless 
form 


P(Q) = 2ntN(E) (4.428) 

2 2 
-VS OAR- 4.429 
=* Tr 0 - 2%) (4.429) 
g = “= 9) (4.430) 

Z 
One can verify that for z = 2 then Q) = 1 and the density of states is 
1- Q? 

P(Q) = OU = 2) (4.431) 


vi- 9? 
This result is identical to that of a particle on a one-dimensional chain with nearest neighbor 
hopping. Then its energy is €, = —2tcos(k) and the density of states for this dispersion is the 
inverse square root function given above. The theory gives the exact result for a chain of 
atoms, when z = 2. For larger values of z > 2 then Q < 1. In these cases the denominator 
1 — Q? is never zero. The function p(Q) is shown in Fig. 4.13 for the cases z = 4 and z = 6. 
The density of states has a smooth shape over a continuous band of energies. The offsite 
Green’s function is evaluated in the homework problems. 


4.4.2. Ising Model 


The Ising model for a lattice of spins is derived in Chapter 1. The usual case is to have 
spin one-half particles at each lattice site. Their z component of spin can point either up or 
down, which is called o = +1. The spins on neighboring sites (ij) interact by —Jo;0,. The 
Ising model with a magnetic field can be solved exactly, analytically, for a one-dimensional 
chain. In two dimensions it can be solved exactly without a magnetic field, as first done by 
Onsager. It is interesting that it can be solved exactly for the Bethe lattice. The Hamiltonian is 

H = -J $ o;o; — h} 9; (4.432) 
(ij) J 
where h = H is the Zeeman energy in a magnetic field. 

The parameter z is the coordination number. It is the number of first neighbors for each 
site. A related parameter is r = z — 1 is called the “branching ratio”. When a particle comes 
to a site, it has r choices of paths to move forward. Here the particles are not moving, but only 
their spins interact. Figure 4.12(a) shows a Bethe lattice with r = 2. The same lattice will be 
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FIGURE 4.13 Normalized density of states for the Bethe lattice for z = 3, z = 4, and z = 6. 


drawn in a different way in Fig. 4.12(b). If the lattice is composed of sticks with hinges at the 
vertices, then if it is picked up at one point, all of the sticks will fall, thereby producing the 
version of the lattice in Fig. 4.12(b). This version is convenient for solving the Ising model. 
The bottom of the figure is called the “boundary” of the lattice. The partition function is 
obtained by averaging the spins at the boundary, and then moving inward row-by-row. 

The first step is to average all of the spins in the j = 0 row. Each spin in the j = 1 row is 
connected to r spins in the j = 0 row. Let o denote a spin in the j = 1 row, and let 
(61, 05,..., 6,) denote the r spins connected to it in the j = 0 row. Averaging just this small 
complex of spins gives a contribution to the partition function 


2, = exp{BJo(o, + 0) +---+0,)+ BAC +o +0, +---+96,)} 


O,:°0,=+1 


= e”9[2 cosh(BJo + BAY (4.433) 


Each spin has an effective interaction with its r neighbors in the lower row of 
o[h+J(o, +---+6,)]. In the nonmagnetic state, the sum over the neighbors will average 
out to a small number. In the magnetic state, the sum over the neighbors could add up to +r. It 
is useful to define the effective magnetic energy h, which acts upon the spins in row j. A 
shorthand notation will be that B, = Bh,. For the first row there are no spins below, so that 
hy = h, By = Bh. For the next row, Eq. (4.433) is used to define B, as 


A,e®'> = eF'°[2 cosh(By + BJO (4.434) 


Sec. 4.4 e Bethe Lattice 253 


Setting o = +1 gives two equations which are solved for the two unknowns (A,, B,): 


Ae! = e”[2 cosh(By + BJ)’ (4.435) 

A,e*! = e "12 cosh(By — BJ)” (4.436) 
E r_ | cosh(By + BJ) 

B, = Bh + E ET BG A (4.437) 

A, = 2”[cosh(By + BJ) cosh(By — BDI? (4.438) 


The factor of By is inserted instead of Bh in the argument of the hyperbolic cosines since the 
effective field is associated with the spins on the lower row. The factor of BA to the right of the 
equal sign in (4.437) is from the energy of the spin in row j = 1. The formula (4.437) can be 
manipulated by taking 


cosh(By + BJ) _ cosh(By) cosh(B/) + sinh(By) sin(B/) 


cosh(B, — BJ) —_ cosh(By) cosh(B/) — sinh(By) sin(B/) (4.439) 
mli = = 2 tanh”! x (4.441) 

These identities combine to give the result 
B, = Bh +r tanh '[tanh(B,) tanh(B/)] (4.442) 


The second step is to average the spins in the row j = | to give the effective field B, = Bh, in 
the next row. The procedure is exactly the same. The general recursion relation, as the rows 
are averaged one-by-one, is 


Bi, = Bh +r tanh™'[tanh(B,) tanh(B/)] (4.443) 


As the spins are averaged, row-by-row, the effective field B, = Bh; converges to the value in 
the interior of the Bethe lattice. This bulk value is denoted as B*. It obeys the self-consistent 
nonlinear equation 


B* = Bh +r tanh '[tanh(B*) tanh(BJ)] (4.444) 
tanh[(B* — Bh)/r] = tanh(B*) tanh(B/) (4.445) 


The solutions to this equation describe the collective states of the Ising model on the Bethe 
lattice. 

The first example is to assume: (i) zero magnetic field (h = 0), and ferromagnetic 
coupling (J > 0). The transition temperature T,, (B. = 1/k,T.), is where the ordering begins 
as one lowers the temperature. At the transition temperature the order parameter B* is zero, 
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and it increases in value as the temperature is lowered. It has an infinitesimal value at a 
temperature which is infinitesimally smaller than T.. In this case the above equation is 


B* = rB* tanh(B,J) (4.446) 
tanh(B 7) = l (4.447) 
kgT, = aan (4.448) 
n( ) 
r— i 
K, =BJ= zh + z) (4.449) 
2 r— iI 


The case for r = 1 is a one-dimensional chain. In this case T, = 0 and there is no ordered 
state at nonzero temperature. For all other branching ratios r > 1 there is a well-defined 
transition temperature. 

It is interesting to compare these results of the Bethe lattice with the results for the Ising 
model on crystalline lattices. What is actually tabulated in Table 4.1 is the quantity K, for 
various values of r. Exact results are also shown for crystalline lattices in two and three 
dimensions (Mahan and Claro, 1977). The lattices are: honeycomb (hc), square (sq), plane- 
triangular (pt), simple-cubic (sc), body-centered-cubic (bcc) and face-centered-cubic (fcc). 
The value of K, for the Bethe lattice is typically below the crystalline result by 10-20%. 
However, the Bethe lattice has the correct trend that K, decreases, and T, increases, as the 
branching ratio r increases. The crystalline results are analytically known in two dimensions, 
since the Ising model can be solved exactly. For example, K, = + In(2 + /3) for the hc lattice 
and K, = 4In(1 + v2) for the sq lattice. 

Another solution to Eq. (4.444) is to solve for the limit that the temperature goes to zero 
when the magnetic field is zero. In the limit that K = BJ — oo the equation to solve is 


tanh(B* /r) = tanh(B*) tanh(K) (4.450) 


In the limit that the argument of the hyperbolic tangent function becomes large, its value 
approaches one. The limit is 


eK — eK 


li — —____ = | —2¢~*K —4K 4.451 
gm tanh(K) Kiek e“ + Ole) ( ) 


TABLE 4.1 Ising model ferromagnetic 
transition termperatures (K, = J/kgT,). The 
dimension is d and r is the branching ratio. 
The column K, is the exact crystalline result, 
while K,(B) is the result from Bethe lattice 


d Crystal r K, K,(B) 
2 he 2 0.659 0.549 
2 sq 3 0.441 0.347 
2 pt 5 0.274 0.203 
3 sc 5 0.222 0.203 
3 bec 7 0.157 0.144 
3 fcc 11 0.102 0.091 


Sec. 4.4 e Bethe Lattice 255 


so that (4.450) becomes 


e 2B Ir — 2B 4 92K (4.452) 
B* ~ rK (4.453) 
h* = rJ (4.454) 


where B* = Bh*. The solution to (4.450) is that the effective local field from the r spins in the 
row below is just h* = rJ. At zero temperature, all of the spins are aligned in the ferro- 
magnetic arrangement. 

The derivation at the beginning of the section began by trying to evaluate the partition 
function. It was used to derive a self-consistent equation for the order parameter B*. Now 
return to the task of finding the partition function. The first step is to count the number of 
spins in the Bethe lattice. The counting is done using Fig. 4.12(b). The top row (j = L) has 
one spin. The next row down has z spins. The next row below that has zr. Successive rows 
have zr’. The boundary row has (j = 0) has Nọ =zr’~'. The number in each row is 
N, = zr’! except N; = 1. The total number of spins is N = )° N; = 1 +2(r" — 1)/(r — 1). 

The partition function was averaged for a cluster of r spins in the boundary row. The 
number of such clusters is N,. So the total partition function after averaging the first row is 


N; 
Z = Ay! exp|B > a (4.455) 
i=l 
N, Ni 
= A" exp] B, 3 o; (4.456) 
Ay = Ai! " = 2[cosh(By + BJ) cosh(By — Bry? (4.457) 


This process is repeated for the second row (j = 1) which gives the result, including the first 
two rows, of 


Ny 

Z, = Ly}! exp Ẹ y a (4.458) 
i=] 

A, = 2[cosh(B, + BJ) cosh(B, — BRI- (4.459) 


This process is repeated for the subsequent rows. The important contribution to the partition 
function for each row is the factor of ./,’. The final partition function is 


Z =A = A (4.460) 
f* = 2[cosh(B* + BJ) cosh(B* — BJ)! (4.461) 
In the last equation, the bulk partition function is assumed to be independent of edge effects. 


The value of the order parameter for the bulk (B*) is used in the definition of ./*. That is, 
assume that B, rises to its bulk value after a few rows, so that the bulk value is relevent for the 
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majority of the spin sites. The expression for .7* can be shortened. The quantity in brackets is 
(where K = BJ): 


cosh(B* + K) cosh(B* — K) = cosh?(B*) cosh?(K) — sinh*(B*) sinh?(K) 

= cosh?(B*) cosh?(K)[1 — tanh?(B*) tanh?(K)] 
= cosh*(B*) cosh?(K)[1 — tanh?((B* — Bh)/r)] 
_ cosh?(B*) cosh?(K) 

= cosh*[(B* — Bh)/r] 

_ , cosh(B*) cosh(K) 

~~ cosh{(B* — Bh)/r] 


(4.462) 


* 


(4.463) 


where Eq. (4.444) is used to simplify the expression. 

The ferromagnetic system has a sharp phase transition only at zero magnetic field 
(h = 0). In this case B* = 0 above the transition temperature, and then ./* = 2 cosh(K). The 
expression (4.463) gives the formula in the ordered state by setting = 0. In the case of 
nonzero magnetic field (h 4 0), the ordering of the spins is gradual as the temperature is 
lowered. Even at high temperatures, the parameter B* does not vanish but approaches 
B* — Bh. In this case one should use (4.463) with all of its various factors. 


4.5. TOMONAGA MODEL 


The Tomonaga model (Tomonaga, 1950) describes a one-dimensional electron gas. The 
procedure is to examine the Hamiltonian of the one-dimensional electron gas and make some 
approximations on it. As a consequence of these approximations, the Hamiltonian becomes 
exactly solvable. The one-dimensional electron gas is not exactly solvable but only an 
approximate version of it. 

The important physics is the recognition that the excitations of the electron gas are 
approximate bosons, although the elementary particles, electrons, are fermions. The excita- 
tions involve two-particle states, for example, moving an electron from one state to another. 
The wave function of the two fermion states has boson properties. The Tomonaga model 
assumes that the excitations are exactly bosons, which is the important approximation. 

The model has been useful in several kinds of problems. First, there are organic solids 
such as TTF-TCNQ whose conductivity is thought to be largely one dimensional (see 
Heeger, 1977). The Tomonaga model has played a role in the interpretation of electrical 
conductivity in these materials (see Luther and Emery, 1974). Second, in impurity problems, 
or X-ray absorption problems, the response of the electron gas to the central impulse can be 
factored into spherical harmonics associated with different angular momentum states /. Each 
angular momentum channel / then becomes a one-dimensional electron gas to which one may 
apply the Tomonaga model. Recently, semiconductor nanotechnology permits the construc- 
tion of semiconductor channels which act as one-dimensional conductors. The Tomonaga 
model is used in the theory of these systems. Single wall carbon nanotubes are another one- 
dimensional conductor. 
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4.5.1. Tomonaga Model 


The original model of Tomonaga (1950) discusses the following Hamiltonian for the 
one-dimensional interacting electron gas: 


1 
H = vp © [elastin +57 2 VPP) (4.464) 
p(k) = x4, —k/2,s%ptk/2,s (4.465) 


The system has length L, and vp is the Fermi velocity of the particles, which are assumed to 
have a linear dispersion relation. The label s = +1 denotes spin, and p(k) is the electron 
density operator. The electron—electron interaction term V, will be specified below. It is not 
4ne*/k*, which is dimensionally incorrect in one dimension, since V, has units of Joule- 
meter. Dimensional analysis suggests the form V, œ e?(k,/k)", where n is any exponent. The 
summation over k states may be turned into integrals by the usual transformation as L — oo: 


ESO = 5 | dw (4.466) 

The basic step in the Tomonaga model is to divide the density operator into two terms: 
p(k) = Ro Te, sAp-+k/2,s (4.467) 

P2(k) = Ra ya, s“p+k/2,s (4.468) 

p(k) = p + P2(k) (4.469) 


The density operator p(k) commutes with any other density operator p(k’). However, the two 
parts p, and p, do not commute with the same parts for other wave vectors. Examine the 
commutation relations: 


[p:(k), pik’) = > X la, À u2 sāprkj2s Aye sape 2s] 
5,5’ p,p'> 


= D lap y2sapik +4 2,508 + k/2 + k'/2) 
S,p> 


— a’ p r2 s4p44/2,s0@ — k/2 — k/2)] (4.470) 
An important special case is k’ = —k 


[p,(k), pı(—k)] = 2 Ip —~k/2,s T n+k/2,5] =), » Ny s (4.471) 
s,p>0 S —k/2<p<k/2 


The right-hand side shows that the commutation relations depend on the operator n,, over a 
range of p values. The operator n,, is replaced by its average in the ground state of the free- 


particle system. 


2(kKL/2n), k< 2Kp 


ns =2 EE tn k > Up 


P 
$ —k/2<p<k/2 —k/2<p<k/2 
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and the commutation relations (4.471) can be written for k < 2k, as 


ipi (K), pi (K) I= (= *) (4.472) 
lok), p(— H)=-(=) (4.473) 
(014), p(—k)] = 0 (4.474) 


The analogous results are included for the other commutators, which can be derived in the 
same fashion. The Tomonaga model assumes that these density operators obey the exact 
commutation relations of 


k 
[pi pH) = Sre (2) 


[p2(k), pa(k] = -òk x (=) (4.475) 


[pı (k), Po(—k’)] = 0 


These relations are the central approximation of the Tomonaga model. The commutation 
relations are not exact, since the commutators give operators, as in (4.470). However, these 
results are obtained when taking the expectation value of the exact commutation relations. For 
example, in (4.470) 


(ipi (k), pi) = D(a -4/2,5p+¥' +42.) 1 + k/2 + k'/2) 
s.p> 


= (a_u r2 sđp4k/2s)O(P — k/2 — k'/2)] (4.476) 
In the right-hand side, the averages are zero unless k’ = —k, so that 


(Epi (k), pi (K) = Ske 2 Lin Ny—1/2) — (Apk) = 20k 2, (Mps) (4.477) 
s,p>0 —k/2<p<k/2 


Although the commutation relations (4.475) are not exact, the expectation values of these 
commutators are given exactly. The approximation is not a very bad one. 

It is convenient to express the density operators p;(+ķ) in terms of creation and 
destruction operators. This step is done so that the creation operators are dimensionless and 
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the commutation relations (4.475) are obeyed. The creation and destruction operators are for 
bosons. These definitions are given below, where the symbol k is always positive: 


kL 

pı&) = hj” 
kL 

pı(~4) = j” 


z7 (4.478) 
palk) =b y/— 
kL 
P2(—k) = baj 
(by, bL] = Sky (4.479) 


When & is positive p; (k) « b}, and when k is negative p; (k) « bt ,. Lhe operators p; always 
commute with p,. The choice (4.478) does satisfy the approximate commutation relations 
(4.475). 

The second term in the Hamiltonian (4.465) may be written in terms of these boson 
operators: 


7 VOW P-K = E Peb + OL NEL + 6-1) (4.480) 
7, = we (4.481) 


The electron—electron interaction term has been recast into an interaction between the boson 
excitations of the electron gas. 

The first term in (4.465) is the particle kinetic energy. It requires some additional work in 
order to express it in terms of boson coordinates. It is not immediately obvious how to express 
ala; in terms of the new boson operators. When faced with this predicament, it is useful to 
examine the commutation relations of this operator. The objective is to find a boson repre- 
sentation of the kinetic energy operator which reproduces the commutation relations. If this 
cannot be done exactly, at least try to find a good approximation. The commutator algebra 
completely specifies the excitation spectrum of the system, so that the excitations are 
adequately described by operators with accurate commutation relations. 

Call the kinetic energy term Hp. Its commutator with p,() is 


[p:(k), Ho] = vr DO 2 Ik Ilat _k/2,5%p+k/2,9 ays] 
S,p>0 s'k’ 


=o D 79 s4p4/2,5(IP + k/2| — Ip — k/21) (4.482) 
s,p> 
k ifp>k/2 


k/2| — Ip — k/2| = . 
Ip+k/2| — |p — k/2| | if p < k/2 (4.483) 
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For small values of k, then p > k/2 over most of the p summation. In this case the above 
commutator is approximately given by 


[P1 (k), Ho] = vrk 2 al k/2,54p+k/2,s = vrkp (k) (4.484) 


S,p> 


The above is a desirable form for the commutator, since the right-hand side 1s also propor- 
tional to p,(k). With the boson representations (4.475) and (4.478) for pı (x), the approximate 
commutation relation (4.484) is 


[b;, Hol = kurb; = Ob; (4.485) 


Of course, the same result would be given by the choice of Hy) = >, (ob! by. Next consider 
the commutator of Hy) with p,. The same approximation in this case leads to 


[p2(k), Ho] = —@;P2(*) (4.486) 


Both of these approximate commutators are satisfied with the following choice for Ho: 
Hy = 920, bib, (4.487) 
k 


H= Yorpbibs + V,(b, +b! (bt + b) (4.488) 


The one-dimensional electron gas (4.465) has been recast into the boson Hamiltonian (4.488). 
The latter is exactly solvable, as will soon be shown. The Tomonaga model (4.488) has been 
derived from (4.465) with several key approximations on commutation relations. The form 
(4.488) is a description of the boson excitations of the electron gas. 

Equation (4.488) may be solved exactly by a variety of techniques. Probably the easiest 
method is to change to a coordinate representation for the boson operators: 


1 


Q, = a +b",) (4.489) 
k 
P, =i So} ~b_,) (4.490) 
(Ox, Pe] = 1044 (4.491) 


In this representation the Hamiltonian is written as 


Ho =3 DPP, + 00,04) (4.492) 
H = 2 DOPP, + EZO,Q x) (4.493) 


Ez = 0} + 40V; (4.494) 
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The new eigenfrequencies are E,. Now change back to a new set of boson normal mode 
operators, which are normalized to the new eigenfrequencies. 


1 
Q, = oe +a.) (4.495) 
k 
E 
P.=i 5 (ay — a) (4.496) 
[a aL] = 1h y (4.497) 
H= D Eloko +3) (4.498) 


These series of steps may be summarized by the observation that the boson operators are 
changed in the following way: 


(A) 
b, +b, = [a 0r +a, (4.499) 
k 


JE 
bl -b= p — a) (4.500) 


These transformations are useful for other problems. 

The Hamiltonian of the one-dimensional electron gas (4.465) has been solved 
approximately. Only the excitation spectrum has been obtained. Some of these excitations are 
fluctuations in the density operator p(k). Very similar results to the Tomonaga model are 
obtained by writing an equation of motion for the density operator and solving it approxi- 
mately. This approach is used in Chapter 5. 

So far the form of the interaction potential V, has not been specified. In fact, physicists 
choose a variety of forms for this interaction to suit their problem. The units of V, are the 
same as vp: Joule-meter (Avy is J-m). One possible choice is to take V, « e = constant = V}. 
The energy spectrum is just altered by having the Fermi velocity increased: 


E, = vpk (4.501) 


| 2 
Vr = UF (vr + m ro) (4.502) 


The constant V} is assumed to be positive, since it describes interactions between electrons. 
The interactions increase the velocity of the acoustic plasmon. 

Another possible choice is to take V, = 2/3(e*k2/k*). This choice leads to long- 
wavelength modes with a constant frequency, which is the plasma frequency: 


E; = k? vp + 02 (4.503) 


- Ane*n 
o, = 40,V;, = —— (4.504) 
where ny = k} /3n? is not the electron density in one dimension, but is a collection of 
constants. In the electron gas, there are two different types of excitations. One is the plasma 
modes at long wavelength, and the other is the electron—hole excitations at shorter wave- 
length. The latter are probably best described by the choice V, = Vo. 
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4.5.2. Spin Waves 


The Hamiltonian (4.464) of the one-dimensional electron gas has other collective 
excitations besides the density oscillations which were discussed above. These other exci- 
tations have the character of spin waves, or magnons. Overhauser (1965) has shown that the 
excitation spectrum is completely described by the sum of these two types of excitations: 
density oscillations and spin waves. This feature of one dimension does not apply to three 
dimensions. The density oscillations are the excitations which occur when there are external 
perturbations such as electric fields. The spin waves respond to magnetic perturbations and 
contribute to the spin susceptibility. 

The spin waves are described by the operators 


o(k) = 6, (k) + o(k) (4.505) 

0\(k) = 2 Sal 479 Ay tk/2,8 (4.506) 
p>0,s 

o(k) = 2 $a) 4/2,54p+k/2,5 (4.507) 
p<90,s 


where the spin index is s = +1 for fî, |. The nature of the spin wave excitations is shown in 
Fig. 4.14. The spin-up and spin-down densities have opposite variations, so there is no net 
change in the particle density. There is a variation in Pr— Py 


Ps(k) = E a} 470,54 +42, (4.508) 
P 

P=P, + Py (4.509) 

o =P; Py (4.510) 


The spin operators are examined in the same fashion used for the density operators. The 
commutation relations are found among these operators and between them and the density 
operators. Some typical results are 


, kL 
[91 (k), 0)(—K )] = òw (=) (4.511) 
; kL 
[o2(k), O2(—K’)] = —ôz w (=) (4.512) 
[o (k), 62(k’)] = 0 (4.513) 
[o;(k), p;(—k’)] = 0 (i, 7 = 1, 2) (4.514) 
The commutator [o;(4), p;(k)] contains one factor of s, and the term s = 1 cancels s = —1. 


This cancellation occurs when the two spin states are occupied with equal probability and the 
system is not magnetic. The spin operators commute with the density operators and so 


pt ON SN 
Pt+ py =O 
- #0 
P, N Pt- Pi 


FIGURE 4.14 Spin-up and spin-down charge densities. 
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describe an independent set of excitations. These excitations can be represented by a new set 
of creation and destruction operators, which for k > 0 are 


01(k) = en 
o,(—k) = dl 


(4.515) 

kL 

0(k) = tale 

kL 

6,(—k) = caj 
[cp cL] = Bee (4.516) 
[c,, bi] = 0 (4.517) 


The next step is to examine the commutation relation of o,(k) with the Hamiltonian (4.465), 
which will establish the energy spectrum of these spin wave operators. They commute with 
the second term in (4.465), from electron—electron interactions, since they commute with the 
density operators. The commutator with the kinetic energy term Hp is: 


[o\(k), Ho] = vF 2 sa) _4/2,s4p+k/2,sl1P + k/2| — |p — k/2|] (4.518) 
S,p> 
xX vpko,(k) (4.519) 
The commutator is evaluated using the same approximation to get (4.484). Exactly the same 


result is obtained by representing the spin wave part of Hy by >>, w,ch cy. The spin wave part 
of the Hamiltonian is 


H,,, =Y oclc, (4.520) 
k 
_ t a = tagt 
H = $ {0;b,br + Oz, CO, Cy + V(b, + b (b; + b_,)} (4.521) 
k 
H = Y {Eala + oclc} (4.522) 
k 


The density operator parts in (4.488), (4.522), and (4.498) are combined with the spin wave 
parts to give the total Hamiltonian H for the excitation spectra of the one-dimensional 
electron gas. The original model of Tomonaga actually described a spinless electron gas. For 
spin one-half systems, the two possible spin orientations lead to another type of independent 
excitation which are called spin waves. The total Hamiltonian (4.522) has the density and spin 
wave excitations decoupled. 

The original Hamiltonian (4.465) did not contain any terms which would cause inter- 
actions between spin waves; there were no terms of the type o(k)o(—k). The spin wave 
excitation spectrum is unchanged by electron—electron interactions, at least in the Tomonaga 
model. 
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The spin wave part of the excitation spectrum can be used to derive the Pauli spin 
susceptibility. The starting point for this calculation is (3.458): 


B 
y(k, io) = -| dte™ (T .o(k, t)o(—k, 0)) (4.523) 
0 


In the Tomonaga model, the correlation function may be evaluated exactly by using the 
operator representation (4.515): 


o(—k, 0) = oi (—k) + o (—k) = (=) (cl +c) (4.524) 
o(k, t) = (=) (cl ,e™ +c,e '%) (4.525) 


The t dependence is determined by H,,,. The further steps in the evaluation of the correlation 
function are identical to the derivation of the unperturbed phonon Green’s function in (3.76): 


=) 2, 
k, io) = | — | —s 4.526 
x(k, iw) ( T J (iw) — a? l ) 
Ik|L 20, 
k,@) = —) 4.527 
Xalk, ©) ( T Ja — 0% + 1205 (4.527) 


The retarded correlation function is found from the analytical continuation iœ > œ + ið. 

The susceptibility is found to be proportional to the length L of the electron gas. This 
dependence on L is correct, since the susceptibility is the total magnetization M divided by 
the magnetic field, and the total magnetization is indeed proportional to the size of the system. 
A more meaningful quantity would be the magnetization per unit volume, which is the above 
result divided by L. The susceptibility demonstrates a resonance phenomenon, so that it is 
singular whenever the external perturbations (k, œ) exactly match those of the excitation 
spectrum © = ©, = kup. 


4.5.3. Luttinger Model 


A model proposed by Luttinger (1963) is a slight variation on the Tomonaga model. It 
has the advantage of being exactly solvable, with fewer approximations, yet is identical to the 
Tomonaga model in some of its essential properties. The basic feature of the Luttinger model 
is that the system has two types of fermions. One has an energy spectrum given by €, = kvp, 
while the other has an energy spectrum given by €, = —kvp. They are shown by the solid and 
dashed lines in Fig. 4.15(a). There is an infinite number of each kind of particle, since the 
occupied energy states stretch to negative infinity. 

In the Tomonaga model (4.465) it is assumed the energy spectrum is as shown in Fig. 
4.15(b). The particles have a linear dispersion relation, but the same kind of particle is 
represented throughout the band of states. 

The two kinds of fermions in the Luttinger model are denoted by the operators a, s and 
Ay k s , Where the subscript 1 or 2 designates the particle. The two bands are quite independent, 
so the two fermion operators anticommute: 


(ai 4.5.4) p y} = Sy Pie Dos (4.528) 
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(b) 


FIGURE 4.15 (a) The Luttinger model has two distinct particles, with separate energy bands. (b) The Tomonaga 
model has one particle, whose energy band is v-|k]. 


The operators p,(k) and o,(k) are defined as in the Tomonaga model (p > 0): 


pi) = > al pyp siks (4.529) 
p(—p) = E di sikip = p,(p)' (4.530) 
c;(p) = E SA ktp sfiks (4.531) 
s= al, sdi kps = FP)! (4.532) 


The advantage of the Luttinger model is that it has the same kind of commutation relations as 
found for the Tomonaga model. However, they are valid for all p, whereas they were valid 
only for p < 2kp in the Tomonaga model: 


[PiP P = py (7) (4.533) 
(7210). PaP] = By (=) (4.534) 
[P P), P2(p")] = 0 (4.535) 
(o1(-P). 11 = 8,» (2) (4.536) 
[o,(—p), o2(p’)] = Ô, p z) (4.537) 
[oi (p), 52(p')] = 0 (4.538) 
[5,(p), pp = 0 (4.539) 


These commutation relations depend, in an important way, on the assumption that there is an 
infinite number of negative-energy particles. For example, the first commutator is 


[p,(—p), PEN = 255,p/ 2 (mg — N k+p) (4.540) 
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The factor of 2 comes from the summation over the two spin configurations s = +1. For a 
finite number of particles, each summation over particle number would just give the number 
of 1-particles N,, 


N, =—2 2 Mig = 2 LM ktp (4.541) 


and the commutator would be zero. However, when there is an infinite number of particles in 
negative-energy states, a nonzero result is obtained. For a finite band, the difference 


2M = Ni k4p) (4.542) 


equals pL/2r at the top end of the band, but it equals the negative of this at the bottom end of 
the band, so that there is no net difference. For a semi-infinite band, there is no bottom 
contribution, so only the top difference is counted. 

The kinetic energy term in the Luttinger model is 


Hy = vp 2 kla} psa ks — d} y Ar ks) (4.543) 


Hy has the exact commutation relations with the operators (p > 0) 


[Ho, pı (p)] = vrppi p) [Ho, P2P)] = —vrpp:p) (4.544) 
[Hy, O;(p)] = vrpo (p) [Hp, 52(p)| = —vppo2(p) (4.545) 


The kinetic energy term is exactly represented by the operator 
TU 
Ho = DA 2 lP: P)Pi CP) + P2(—p)P2(p) + 9)(p)o,(—p) + 52(—p)o2(P)] (4.546) 
p> 


In the Tomonaga model, the boson approximation applies only for excitation with small k. 
This restriction is removed in the Luttinger model. The transformation to boson operators is 


[pL t [pe 

P\(—p) = bip w’ pı(p) = z 
L L 
Pp) =b} p5, on f 


(4.547) 
[pL [pL 
O(P) = cip w’ o(p) = c}, p 
pL pL 
aP) =d a P) = cr 
The Hamiltonian is now 
=L purlbi Dip +b, „ba -p + Ch ,C1p + C2, pea, (4.548) 
p> 


The operator p,(p) for p > 0 takes a particle from state k and puts it into p +k. This 
operation will make an electron-hole pair when k < kp and p + k > kp. The summation over 
all such electron—hole pairs is represented by the boson creation operator bi, . For particle 2, 
the Fermi “surface” is at the negative wave vector —kp. Electron-hole pairs are made mostly 
at negative wave vectors. The operator p,(—p) = >-, al a, k+p for p > 0 creates these 
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bosons, since it takes an electron from the occupied state kp < k + p to the unoccupied state 
k < —kp, where k is negative. 

Various kinds of interaction terms may be added to the Luttinger model. Those which 
arise from electron—electron interactions are expressed as the product of four fermion 
operators, or two density operators. These Hamiltonians are exactly solvable, since they 
describes linear coupling between two harmonic oscillator systems. 

The Luttinger model has the advantage of being exactly solvable. Of course, one could 
add other terms which might render it no longer exactly solvable. The disadvantage of the 
model is that it is unphysical, since it contains the infinite reservoir of negative-energy 
particles. 


4.5.4. Single-Particle Properties 


Some of the most interesting applications of the Tomonaga—Luttinger models are 
concerned with single-particle properties of the electron gas. An important quantity is the 
occupation number n; s = (a; k.s4i,k,s)> In the interacting system. A more ambitious calcu- 
lation would be the one-particle Green’s function 


G; (k, t) = —i(Ta;, ,(t)a! , ,(0)) (4.549) 


To obtain these quantities requires a representation of the single fermion operator a;,, in 
terms of boson operators. The discussion follows Mattis and Lieb (1965) and Luther and 
Peschel (1974). 

The representation of the single-fermion operator in terms of bosons is found, as always, 
by examining the commutation relations. A representation of a; z s is satisfactory if it obeys 
all the proper commutation relations with the other operators. The first step is to Fourier- 
transform into a real-space representation: 


1 . 
Y (x) = W 3 a (4.550) 
1 , 
Pİ (x) = — e~a} ; (4.551) 
ia vk 


The advantage of this representation becomes clear when considering the commutator of 
Y,,(x) with the density operators. This discussion uses the Luttinger form of the Tomonaga 
model. Typical commutators are 


[‘Y is(X), p;(p)] = Sje P a) (4.552) 
[Fs 0), op] = Syse Pya) (4.553) 
which is derived in the following way: 


I i 
(Fis). PP) = FeO e laiks D wapres] 


5. . 

— LJ ikx 

= JL e Aj ks Oss Òk=k'+p 
L kks! 


5 oOo, | 
= g” 2 elk “Ai ks = 6; je” Px) (4.554) 


268 Chap. 4 e Exactly Solvable Models 


The commutator [‘Y;,(x), p;(p)] has a simple form, since it is just proportional to Y, (x). The 
solution would be simpler if the commutator were a constant or even proportional to a density 
operator. It is not, so the solution of (4.552) is more complicated. One possible solution has 
the form 

Yi s(x) = Fy (x) expli e)l (4.555) 


lo _. , 
KE) =-7 E P sop- lpp) +solp) (4.556) 
p> 


The prefactor F, (x) can be a function of x but is a c number in the sense that it must commute 
with both p; (p) and o(p). Next show that this choice does satisfy Eq. (4.552) 


[Pis Pil = Flep — pie") = F (ep e” — 1e 
= Fy, pile” =. Pi] Yis (4.557) 


The last line is valid only when the commutator [J,, pı] is a c number which commutes with 
the operator J,. It does for the Ji (x) in (4.556): 


T l i ipx 
Mi PiP) =7 ze IPC), pp = e” (4.558) 
The next observation is that the factor 


pip) + s0,(p) = vd +ss')a pay yi by = 2 2 a! yapsak (4.559) 


since the factor (1 + ss’) = 0 unless s = s’, and then it is 2. The equations can be condensed 
by introducing the notation of a spin-dependent density operator (p > 0): 


Pal) = al kp si,k,s (4.560) 

P= E al Ais kaps (4.561) 

Pip) = E pis(P) (4.562) 

op) = E spP) (4.563) 

KO = — pt) (4.564) 
all p 

P(x) = F expli] (4.565) 


The spin-dependent density operators can be represented by boson operators similar to 
(4.547) with an additional spin subscript. 

The form of Y,,(x) in (4.556) is a solution to the commutator equation (4.552). 
Unfortunately this solution has some undesirable properties which will force a modification. 
The need for changes in Y,(x) may be understood by examining the form of the operator 
| CAREA REA )) for the noninteracting electron system, which is the Luttinger model with just 
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the Hamiltonian Hp in (4.543). At zero temperature, the noninteracting system has the feature 
that the momentum distributions for particles 1 and 2 have the form 


nirs = ke — k) (4.566) 
Nd ks = O(kr + k) (4.567) 


This fact can be used to evaluate the correlation function (PT OFE )) by using the inverse 
of the transformation (4.550): 


l ek’ 
(POPLE) = F eM (ah aes) (4.568) 
kk’ 
1 eke 
=- Se MOPS Mn ks (4.569) 
Li " 
© dk _, 
_ | eH) Okie — K) (4.570) 
wt OFO a“ 4.571 
(P,Q) Fis )) = ~ Qni(x —x + in) (4.571) 
i , eikr(x-x) 
(‘P5.(x) P25(x )) = s>——_= (4.572) 


— 2ni(x — x — in) 


where the factors of +in are added to aid convergence at infinity. 

The objective in choosing the representation (4.565) is to make the result for 
(PT (x) (x )) be like (4.571) for the noninteracting Luttinger model. A method of doing 
this was suggested by Luther and Peschel (1974). It uses a limiting process, where the wave 
function ‘P,,(x) contains a parameter a, and the limit x — 0 is taken at the end of the 
calculation. Including the parameter «, the position space operators are represented as 


1 
Y (x) = ina explik-x + J,,(«, x)] (4.573) 
1 
Y, (x) = Srna exp[—ikpx — J>,(a, x)] (4.574) 
—ak/2 
Jie) = — FO o- eB) (4.575) 
—_ (a,x) (4.576) 


This new form for J,,(a, x) may be expressed in terms of the boson operators: 
p>0 


2 , . 
Jis(4,x) = do e=? Oie” 7 bis pe”) (4.577) 


2 , , 
hla = E P | Ope — bas ape) (4.578) 


p>0 
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The exponential factor J, ,(«, x) has the same form as (4.556) in the limit where « — 0, so the 
commutator (4.552) is obeyed in this limit. The prefactor 1/./2na is explained below. Now 
consider the evaluation of the quantity 


/ l —i x— = X, X a 
(PT OFE )) = xe Kp(x—x’) (e J s(0,) ois x) (4.579) 
1 , /2 
— sage ofen eae “(eo ), _ ba) 


[2T ay | 
x exp ew ? “(el bisp -e bi ol) (4.580) 


The right-hand side of this expression is an average of exponential functions of boson 
operators. These expressions are evaluated in Sec. 4.3.2. Each exponent is separated by using 
the Feynman theorem exp(4 + B) = exp(A) exp(B) exp(—[A, B]/2). Then the factors are 
commuted until all the destruction operators are on the right: 


/ | —ikp(x—x’)—O 9 (x— —o 2n Li Li 
FOYE) = Srat fr) Oo tae e *P Ibi (e ke J 


kL 
x exp| en %#/2 27 b ike’ _ „ikx 
pje kL 1s,4(€ e”) 


-ak | œ d , 
= (1 —e’*) = | eet — e'*) (4.582) 
0 


21 
pox) = T% 


At zero temperature, the quantity in the final brackets gives unity, which gives the following 
prediction for the noninteracting electron gas: 


1 , 1 / 
yt Y Ay — p`ikr&-x )—þox—x') 4.583 
FLOP) =s—e (4.583) 
The expression for þọ(x) has the form of an infrared divergence as discussed in Sec. 9.3. 


Expand the exponential exp(ikx) and integrate term by term. The factor exp(—«k) ensures the 
convergence of these integrals, which is the primary role played by g: 


© (ix) (ak o (ix)! 
PoC) =— Da] dkk ee = — Dr 
I=0 t Jo 1=0 +0 
= n( 1 — =) (4.584) 
-p0 —__} 4.585 
© Ti ia (4.585) 


The series for p(x) is recognized as a logarithm, which gives the final result 


1 


LR (4.586) 


/ l —ikr(x—x 
POPO) = ye 


Like (ex) | 
= eie O O 4.587 
m a — i(x — x’) l ) 
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The limit x —> 0 does indeed recover the noninteracting value (4.571). The parameter a 
becomes the convergence factor n. 

It is easy to check that the factor (PE (x) P(x! )) is also given correctly. The repre- 
sentation (4.575) for VY ,,(x) reproduces the commutation relation with the density operators 
and also gives the correct ground state momentum distribution for the noninteracting system. 
All these results, of course, apply in the limit where « — 0. The commutator of ‘P,,(x) with 
H, is also given correctly, since the latter is expressed in terms of the density operators, which 
have the correct commutators. The representation (4.575) is suitable for the single-fermion 
operators. 

This representation can be used to calculate many interesting properties of the Luttinger 
model. For example, the electron Green’s function is 


G(x — x’, t) = —i(T¥ (x, HT x’, 0)) 
= —10() (eH, (xje OH (x) 
+ 10(t)(P1 (x eo, (xe ") (4.588) 


The correlation functions can be evaluated at zero temperature, using the same steps which 
led to (4.581). The time dependence of ‘¥,,(x, t) for the noninteracting Hamiltonian is 


Yim, t) = 


l 
explikpx + J1,(a, x, t 4.589 


3 2T, ; 

— k _ 7 E 

ist % 0 = k 4 K me Dik — ee OD! 1) 
> 


Jis(&, X, t) = Jıs(&, x — Vpt) 


(4.590) 


The time dependence of ‘V,,(x, t) = Y,(x — vpt) merely changes the factor x in J;,(a, x) to 
z — „Vrt This rather trivial change makes it possible to use the previous result for 
Fi OP) )) to evaluate ( (PT (x! YP (x, D): 


PLOP D) = 5—— explike =x) — Qo — x + vrt) 


eikr(x—x’) 
E omic — x’ — vrt — ia) (4.591) 
(P15, NY! œ) = z explikp(x — x) — pox — x + vrt)] 
ike (x—x’) 
(4.592) 


7 2ni(x — x! — vpt + ia) 


The factor (Fx, NFI (x )) has just the Hermitian conjugate of pọ. The Green’s function for 
the noninteracting system is easily obtained (a — 0): 


ikpx — 
G,,(x, t) =< _— + ae (4.593) 


20 |X—Upt+in x—vpt — ia 
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This equation can be Fourier-transformed to obtain the Green’s function in the wave vector 
representation: 


G,,(k, t) = | dxe—""'G, (x, t) (4.594) 
= ie FF ) (Q(HO(k — kz) — O(—NO(kp — k) (4.595) 


This result is the same G(k, t) which is obtain in the fermion representation: 
Gis(k, t) = —i (Ta; x (Da z (0) (4.596) 


where the energy has been normalized to the Fermi energy: €, = vp(k — kp). The correct 
result for G;,(k, t) again illustrates that the boson representation (4.575) for the single-particle 
operators will faithfully reproduce the results obtained directly from the fermion repre- 
sentation. The virtue of the Boson representation is that more difficult problems can be 
solved. In particular, interaction terms can be added to the Hamiltonian. Exact expressions 
can be found for Green’s functions, or other correlation functions, although they are usually 
difficult to evaluate analytically. 


4.5.5. Interacting System of Spinless Fermions 


An exact solution can be obtained for various correlation functions, even for the 
interacting electron gas in one dimension. First solve for the occupation number. This solution 
relies upon the representation of the single-particle operators which was developed in the 
prior subsection. The Hamiltonian in this part is taken to be the Luttinger model for spinless 
fermions (Mattis and Lieb, 1965): 


H=2 [oC] bip +b} pbo,-p) + VE b} p + ba -pbi p) (4.597) 
p> 


The interaction term comes from particle—particle interactions between the two types of 
fermions. Other interaction terms could be considered. 

The first step in the solution is to learn the method of diagonalizing this Hamiltonian. 
There are several ways to do this, and all give the same result. A canonical transformation is 
used to obtain a new set of boson operators «,, B,, which are defined as 


bip = B, cosh(A,) — of sinh(A,) (4.598) 

bi , = B} cosh(A,) — a, sinh(A,) (4.599) 

b> _, =a, cosh(A,) — Bi sinh(A,) (4.600) 
bi _, =a) cosh(A,) — B, sinh(A,) (4.601) 
[bi p 51 „1 = [B, Bf] cosh? (An) + [a}, a] sinh?(A,) (4.602) 
= cosh’(A,) — sinh? (à,) = 1 (4.603) 


[a,, Bi] = 0, [xp of] = 1 (4.604) 
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The various commutation relations are still obeyed in this new representation. The parameter 


Àp is chosen so that the Hamiltonian (4.597) is diagonalized. It is first written out in terms of 


the transformed operators: 
H= L (BIB, + afa,){[cosh?(A,) + sinh?’ (A lo, — 27, sinh(A,) cosh(A,)} 
p> 
+ (Bhat + o,B,){[cosh’(A,) + sinh? (A, )] V, — 20, sinh(A,) cosh(A,)}] 
Since these are boson operators, the ordering of terms such as «8 = Ba does not matter. The 
zero-point motion terms are ignored. Two combinations of hyperbolic functions seem to 


occur: 


cosh? (à) + sinh” (A,) = cosh(2,,) (4.605) 
2sinh(A,,) cosh(A,) = sinh(2A,) (4.606) 


The Hamiltonian is diagonalized by setting to zero the coefficient of the term (Bho + «,B,). 
This step gives tanh(2A,) = V,/@,, so that the diagonalized Hamiltonian is 


H= 2 EBIB, + atap) (4.607) 

E, = Jœ- V2 (4.608) 
(A) 

cosh(2h,) = -2 (4.609) 
Ep 


The transformation to the new operators is used to evaluate the properties of the interacting 
system. The «, and B, operators refer to the actual boson normal modes in the interacting 
system. The ground state of the system is the vacuum of a, and B, particles; i.e., 
a,|0) = 0,8,|0) = 0. These are the same set of normal modes in the Tomonaga model 
(4.498). 

Consider the evaluation of the fermion occupation number, which is given in (4.581) as 


the ground state expectation value of the operator combination: 


(PEOP O) = z exp[—ike(x — x’) — po — x’) (4.610) 
e7% — (e77 x) g0) (4.611) 


where J, (a, x) is given in (4.577). The ground state of the system must be the particle vacuum 
of the bosons with excitation energy E, in (4.608), since these are the normal modes. The 
J\(%,x) operator must be expressed in the a, and B, representation. The transformation 
(4.600) produces a redefined operator form 


—4 2T ipx : 
J (a, x) = de p/2 [ze [B, cosh(A,) — at} sinh(A,)] 


— e TBS cosh(A,) — a, sinh(A,)]} (4.612) 
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It contains operators of both types «, and B,. These operators are independent, since they 
each describe an independent Boson system. Each of these boson systems are averaged 
independently. The ground state average gives 


Po(X) = Pax) + p(x) (4.613) 
eo Pal) — (e Ya) e/a) (4.614) 
oe oo) — (oY) plo) (4.615) 

I(x) = § e2. iB sinh(A, (ea, — e”*a!) (4.616) 

p>90 pL 

I = Yo el? B cosh(A, )(e”*B, — expt) (4.617) 

p>0 pL 


The separate averages for þ,(x) and þ,(x) are similar to those found earlier in (4.582). The 
average for ¢,(x) is identical to the earlier average for p(x), except for the extra kernel 
cosh(A,,). The average for ,(x) also contains a unique kernel sinh(A,) and has x — —x. By 
analogy with (4.582), at zero temperature 


p(x) = 1} y T cosh?(A,)(1 — e) (4.618) 


p, œ) = — Ty Na sinh?) p (1 — e™) (4.619) 


The result for p(x) is manipulated by replacing cosh? (A,,) by its equivalent 1 + sinh’(A,). 
The term with “1” is identical to oe 


,(x) = bo) + as ~ sink A, (1 — e?*) (4.620) 


so that 


Pa + Oy = Po + Os (4.621) 


(x) = = TES — ~ sinh? O L — e™) + (1 — e”™)] 


OAE si -o= 4.622 
CPi) 1) =- Ga (4.622) 
The effect of the interactions on the electron gas is contained in the exponential factor 
exp(—ọ,). The other terms in (4.622) are the same as for the noninteracting electron gas. 
By using the relation 


h(n.) = g=% 
sinh (à,) = 5 [cosh(2A,) — 1] = 5 2 | (4.623) 
_ (ap a 
0,(x) = | 5 —e? (7 - E, ii — cos(px)] (4.624) 


which uses the prior result (4.609). 
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Any evaluation of the factor o,(x) must assume some specific form of the potential V, 
between electrons. One possible model is to take V, = pVo, V) = constant. This form of the 
potential is obtained from a delta function interaction in real space. This model assumes that 


the particles interact only when they directly collide. In this case (œ, = pvF) 


E, = pJ} — V? (4.625) 


, l v 
g = sinh? (A) = 7 a — J (4.626) 
,(x) = 2g | dP 6-11 — cos(px)) (4.627) 
0 P 


The factor sinh” (A,) is a constant, which is called g. The integral for the exponential factor 
,(x) is now simple to evaluate, since it has the same form as earlier for p(x) in (4.582): 


LO =el + 6460] = gin(1 +5) (4.628) 


The delta function model makes the following prediction for the momentum distribution of 
the 1-particles: 


POF (x! “wn l 4.629 
(POROD = -aae r F + yF oe” 
CO 
mh, = | dret = PTE) 
—oo 
œ dy pitlk-ke) y2 4.630 
=-| Soe (x? + (4.630) 


Setting g = 0 recovers the noninteracting case nı = O(k; — k), which is obtained by 
closing the contour of integration in the UHP (upper half-plane) when kp < k and in the LHP 
(lower half-plane) when kp > k. The pole at x = —ia is circled only in the latter case. 

Mattis and Lieb (1965) showed that a more interesting result is found for the case where 
the coupling constant g is nonzero. Then n; , = constant, independent of k, so the Fermi 
distribution is totally destroyed. This happens even in the limit where g is infinitesimally 
small. As g — 0, then n; 4 = 3. 

This result is obtained by changing the integration variable to y = x/a: 


-œ 2ti y+i OF +1) 


© d iya(k—kr) 1 
nik = -| Aki (4.631) 


The only a dependence is in the exponential factor. This exponential factor is needed for 
g = 0, since it tells us whether to close the integration contour in the upper or lower half- 
plane. However, for a nonzero value of g the integral converges even without the exponential 
factor. Therefore set « = 0 before doing the integral and consider 


2% dy 1 1 
__ ee 4.632 
Mk i 2niy + iQ +1) (4.632) 


276 Chap. 4 e Exactly Solvable Models 


The right-hand side is no longer a function of k — kp, and is a constant. The integral for y > 0 
is added to that for y < 0 by changing the variable y —> —y in the latter to give the real 
integral: 


° dy 1 l 1 
=| < |— + - 4.633 
“hk I ie oh 57) (4633) 
Xd 1 1 Ta 
=| ° een (4.634) 
o T(1+y E 2,./nT(1 +2) 
The integral is in a standard form, which is given in tables [G&R, 3.194(3) after changing 


y? = x] in terms of gamma functions. In the limit where g > 0 then T G) = vm, Tr(1) = 1 so 
that 


i — 1 
hm Nik = 5 (4.635) 


The distribution function is a nonanalytic function of the coupling constant g. The usual 
noninteracting distribution function is found in the case of g = 0. The introduction of an 
arbitrarily weak delta function potential destroys the Fermi distribution, and each wave vector 
state is occupied with an equal probability. For the case where g — 0, this probability 
approaches L The g = 0 result is not obtained in the g — 0 limit. 

This result would be difficult to prove by perturbation theory and shows the value of an 
exact solution. These results pertain only to the one-dimensional electron gas. Behavior of 
this type is called non-Fermi liquid behavior. Fermi liquid theory is discussed in Chapter 11. 
Another name to describe such systems is Luttinger liquids. 


4.6. POLARITONS 


4.6.1. Semiclassical Discussion 


The word polariton was coined by Hopfield (1958) to describe the normal modes in 
solids which propagate as electromagnetic waves. The word is a combination of polarization 
and photon, because these modes are combinations of free photons and the polarization 
modes of the solid. A new word was needed, because a new view was then emerging about 
the optical properties of solids. Hopfield popularized this new physics, although similar ideas 
had been discussed earlier by Fano (1956, 1960) and by Born and Huang (1954). 

In the old view of electromagnetic wave propagation in solids, the light shone upon the 
surface of a sample and went into it. The polarization modes of the solid, e.g., TO phonons, 
could absorb some of this light. 

The new view is that the light and the polarization modes in the solid are coupled into a 
new set of normal modes. These new modes are called polaritons. When light is shone upon 
the surface, polaritons are created which propagate inward. The mathematics is trivial; since 
both the photons and the polarization modes are usually described by harmonic oscillator 
equations, the new modes are obtained by solving coupled harmonic oscillator equations. The 
physical effect is semiclassical and need not involve quantum mechanics. The photon Green’s 
function 2 (q, œ) in a system with dielectric function E,y(q, œ) was derived in (2.185). A 
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transverse wave will propagate with E L q, and the normal modes are the poles of the Green’s 
functions. Assuming they exist, these poles are at 


as (q, @) = o? = (qc) (4.636) 


where €,(q,@) is the transverse dielectric function. 

The normal modes are the values of @ which satisfy this equation; call them Q,. For 
example, consider the phenomenological equation from optical phonons in ionic crystals for 
the transverse dielectric function, which is the same as the longitudinal dielectric function at 
long wavelength: 

— € 


20 (4.637) 


€ = aaas ausu 
(@) = Eoo + 1— a, 


When this form of the dielectric function is used in (4.636), there results a quadratic equation 
for œ. It may be solved in order to obtain the following two solutions: 


= 3 (0; + fo) +5 (a; + 020) — 4620 2 2 J2 
_ (4) (4.638) 


Eoo 


8 = 
Og = 


These solutions are plotted in Fig. 4.16. The solid lines are the solutions to (4.638) and are the 
actual normal modes in the solid. The dashed lines show the unperturbed photon mode Og 
and phonon mode 7 . It is these two which are coupled to form the new set of normal 
modes. 

The picture of energy propagation is illustrated in Fig. 4.17. It shows a slab of polar 
material upon which light of frequency œ is incident J from the left. Some of the light is 
reflected R, while other parts may be transmitted through the slab and out the other side T. 
However, inside of the slab, the normal modes which propagate are those given by Eq. 
(4.638). The wave vectors of the polariton modes are determined by œ = Q, and not by the 
free-photon value q = (@/c),/é,,. This point is further illustrated by the observation that 
there are no polariton modes of real frequency in the range ®;9 > © > Oro, as shown in Fig. 
4.16. Since there are no polariton modes in this gap, no energy can be transmitted through 
the slab (actually modes in the range ®;9 > © > @zo have complex wave vector gp + ig; = 
(@/c),/e(@) so that energy can be transmitted through the slab of thickness d with the 
probability exp(—2dq,), which is neglected). All the energy is reflected from the slab, as 


LO Reflectance 


FIGURE 4.16 Polariton energy bands are shown at the left. There are no bands allowed for ®79 < @ < @ 0, so that 
the solid must have a reflectivity of unity in this interval. 
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polariton 
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FIGURE 4.17 When light has normal incidence / upon a slab, there are the usual reflected R and transmitted 
components 7. Inside the slab, the normal modes are polaritons. 


shown in Fig. 4.16. The complete reflectivity for @;9 > © > @7g was known earlier for a 
simple oscillator in a solid, but the physics is clarified when cast into the form of an energy 
gap in the spectrum of normal modes. 

The above equations have no damping in them, so that light is not absorbed in this 
model. All the electromagnetic energy, incident upon the sample, is either reflected or 
transmitted. The transmitted parts get carried through the slab by the polariton modes. The 
absorption of electromagnetic energy can be introduced into the equations by introducing 
damping for the phonons or additional scattering for the photons. However, if the TO-phonon 
system has no damping, then it does not cause energy absorption. The polariton picture is a 
rather different physical model than the older one in which the phonons caused the absorption 
of energy in the solid, and this absorption in turn led to reflection. 

The most direct experimental verification of the theory is to measure the polariton 
dispersion curves, which has been done for the polaritons associated both with optical 
phonons and excitons. The first measurement was by Henry and Hopfield (1965), who 
measured the polaritons associated with the optical phonons in GaP. Their results are shown 
in Fig. 4.18, where the solid line is the theoretical curve (4.638), which compares well with 
their experimental points. They measured these modes by Raman scattering. The frequency of 
the Raman shift gave the frequency of the excited mode, while the directional change during 
the Raman scattering gives the information needed to deduce the wave vector of the polariton. 
There is obviously excellent agreement between theory and experiment. 


4.6.2. Phonon-Photon Coupling 


The semiclassical theory will be calculated using Green’s functions. The self-energy 
operators for the photon, at least the parts due to the phonons, may be evaluated exactly, 
which gives the semiclassical theory. An interesting aspect to this derivation is that there are 
contributions from both the first- and second-order self-energy. Fortunately that is all, and the 
exact self-energy is found after the first two orders of perturbation theory. 

The Hamiltonian has the form 


H=H)+H' (4.639) 


Hy = Oro X biby + 3 Dga lq (4.640) 
q q 


, e 
H =— > P,-A(R,) + sya AR) -A(R,) (4.641) 


Sec. 4.6 e Polaritons 279 


Ok, IN If DIRECTION 
© kz IN 100 DIRECTION 
O POLYCRYSTALLINE SAMPLE 


hw (eV) 


heq (eV) 


FIGURE 4.18 A plot of the observed energies and wave vectors of the polaritons and of the LO phonons in GaP. The 
theoretical dispersion curves are shown by the solid lines. The dispersion curves for the uncoupled dispersion curve 
are shown by the dashed lines. Source: Henry and Hopfield (1965) (used with permission). 


The operators Bg: b, are for phonons and dgr» al, for photons. The interaction between the 
photons and optical phonons is contained in the term H’. The ions in the solid are treated as 
spherical balls with charge +e, equilibrium position R;, and momentum P;. The reduced mass 
M is used for the relative motions of positive and negative ions in the optical mode of 
oscillation. In terms of these coordinates, the current J; and vector potential A(R;) are 


expressed (see Secs. 1.1 and 2.10) in terms of phonon and photon operators: 


Cw \22 a qr) “ig 
=P = le D(a) é (bt — ba)" = gL aR, 


(4.642) 
1/2, 
ID = ie() EO} - bg) 
A, (Rj) = (me) : eV Bay tatg) == DA, (ger® 
prey a \ @qV u qÀ -a NN m n (4.643) 


1/2 
A,,(q) = 2 (2: MACAN + a'a) 


q 


The first self-energy is from the photon self-energy, which arises from the term in A*. The 
interaction in the second term in (4.641) is 


2 
e 
Hp =—— )_A,(qg)A, (— . 
A? Mn „(q) u( q) (4 644) 
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where Vy) = V/N is the volume of the unit cell in a binary crystal of positive and negative ions; 
1.€., Vo is the volume per phonon mode. The self-energy comes from the first-order term in the 
expansion of the S matrix: 


BO 
2 (k, io) = — | dte™ (T.A,(k, t)Ay(—k, 0)) 


e P mf 

Vo | É | iE (4.645) 
x (TA, (k, 1)45(q, T1)4s(-4, t))Ay(—k, 0)) + --- 

A, (k, t) = e” A e 


A, (k, t) = eA eo 


= Bk, im) + 


The four-operator sequence in the interaction term is equal to 


P fÊ n n a n 
l de" | dt, } (TA), (K, 1)45(q, T1)4s(—4q, T11)4,(—k, 0)) 
° 0 aè (4.646) 


= 2° DYKK, ia) DS) (k, io) 
ô 


The factor of 2 arises because of the two choices of pairing: k = q or k = —q. The series in 
(4.645) is 
P (k, iœ) = IO(k, io) + 0” 3 PN(k, i@) DS) (k, i) ++- 
(4.647) 
oO = eo 
Mvo 


According to (2.184) the self-energy TI (k, iœ) enters into the denominator of the photon 
Green’s function with an additional factor of 47: 


An(Ony — kik, /k*) 


D (k, io) = —— w wt 4.648 
uv ) (im)” — w2 — 4n” l ) 
4 
4ni ® = tre = o? (4.649) 
0 


The quantity 4x11 = o? is just a plasma frequency. It would be the longitudinal frequency 
©; = @, of the ion vibrational oscillation if there were no other restoring force. Of course, 
there is a restoring force, which leads to the transverse frequency ®79 = Wy. The longitudinal 
frequency is then determined by the combination of the short-range restoring force 7 and 
the long-range Coulomb force 07, @}o = Of + 0. All this discussion is slightly irrelevant 
because we are solving for transverse modes, not longitudinal modes. For the transverse case, 
the modes are at the poles of the Green’s function in (4.648). The poles are at the frequencies 
o? = wf + 41, ¢(k, ©), which would be at c?k? + w; if there were no other contributions to 
the self-energy. However, there is another contribution to II(k, œ) from the P- A term in H’, 
which cancels 0%. 

The 4? term in the interaction H’ does not contribute any further terms to the self-energy 
of the photon. The self-energy contribution II™ provides an exact evaluation of this term in 
the Hamiltonian. This exact result could have been deduced without the aid of Green’s 
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functions. The Hamiltonian for free photons can be written in terms of harmonic oscillator 
coordinates by identifying the displacement q,,(k) = 4„(k)/v 4r. The Hamiltonian has the 
form 


Ho, proton = 5 ZIP) * p(k) + œa) - a(—K)] (4.650) 


When the 4? term is written in the same notation, it is 2q(k) + q(—k)/2. Add the A? term to 
Ho, and then the new potential energy is Q? q(k) - q(—k) /2, where the new modes are 
OF = o? + œ. The same result was obtained using Green’s functions. 

The next step is to evaluate the self-energy arising from the J- A term in the interaction 
H’ in (4.641). The symbol J; denotes the current of the vibrating ion pair. The J+ A term is 


rewritten in wave vector space as 


H,.4 =- EW: A(—k) (4.651) 


One factor in the self-energy expressions for the photon is the correlation function of the ion 
momentum J„(k) with itself. Define a correlation function 


B 
®,,,(k, iœ) = -| dre (T [ew J, (ke U,(—k)) 
0 


e Wo IOT TO, / LÝ en t 
= ðw =F JY, of dte™ le? (bby) + °(b, by) ] 
e og 


=8, L —_0 4.652 
i M (io) — (OG 


which is easily evaluated since the properties of the phonon operators are known. 

The J+A interaction does not contribute any term to the first-order self-energy 
expression IT), because the first-order term has only a single factor of the operator J,,(k), and 
the average of this operator is zero. The second-order term in the expansion for the S matrix is 


l B B ` . 
Ej dte” | ar, | dt X, 9} (T J3 (ky, 71 )J5 (Ky, T2)) 
2:Vo 0 0 kik; 88’ 


x (T.Ay(K, 1)49(ky, 1 )Ay (k2, 12)4,(—k, 0)) 


which is evaluated by the usual rules to give 
Elk 10) Os5:(k, 10) D(k, iw) (4.653) 


where ®55/(k, iœ) is given in (4.652). A factor of 2 comes from the two possible pairings 
k = k; or k = k,. This self-energy expression is 
FOG 


WW k, ia) = k, io ———__—_—_ 
(K 10) = | Ov (k, i) = FE ah 


(4.654) 
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This self-energy term is the only one from this interaction. The total self-energy is obtained by 
adding the two contributions: 


2 O 2 2 
4nTI(k, w) = ar +11] = 03] 4 r | T 


(in)? — o2 p (iw)? — o2 
The photon self-energy for the Hamiltonian (4.641) has been found exactly. The energy 
denominator in the photon Green’s function (4.648) can now be written in terms of the 
dielectric function: 

(iw) — œ? — 4nII(k, im) = (iw)*e(iw) — o? 


o2 


p 
«2 — (iw)? (4.655) 
AT(Ouy — kuky /k?) 


(iw) elio) — o? 


e(i@) = 1+ 


2 (k, iœ) = 


This dielectric function should be compared with (4.637) from optical phonons. The present 
discussion assumes that ¢,, = 1. Otherwise, a value of £ other than unity will modify the 
vector potential with factors of €. Similarly, identify the static dielectric constant as 
E& = 1+ o? /œĝ. Another feature of the dielectric function (4.655) is that the frequency of the 
longitudinal mode oscillation is determined by the condition that ¢(@;9) = 0, which has the 
solution ®7 = 05 + @). The transverse frequency is 79 = Wp. With these identifications, 
the two expressions (4.637) and (4.655) for the dielectric functions are identical. 

The Green’s function solution of the Hamiltonian for the coupled phonon—photon modes 
yields exactly the same equation w7e(w) = wz as found from the semiclassical arguments in 
Sec. 4.6.1. The two modes, photon and phonon, are coupled and form two new modes which 
are polaritons. In fact, the original Hamiltonian (4.641) could be solved by a canonical 
transformation to a new set of operators, say a, and B,, which diagonalize the Hamiltonian. 
These new creation and destruction operators are for the polariton modes. This exercise is 
assigned as a problem. 


4.6.3. Exciton-—Photon Coupling 


The polariton concept applies to the mixing of any polarization waves with the photons. 
Although the phonons are an obvious example, the physics is the same for waves which are 
associated with electronic polarization. Here we analyze a simple case of this phenomena. The 
solid is treated as a collection of atoms which are only weakly interacting, say by van der 
Waals forces. The electrons in each atom can occupy a set of discrete energy levels ¢,,. This 
model is applied to molecular solids, in which molecules are weakly bound together, and the 
optical properties of the solid are often only weak perturbations on the optical properties of 
the free molecules (see Davydov, 1971, or Mahan, 1975). 

A light wave, which is traveling through the solid, can induce the electrons to change 
their electronic state from n to m. This excitation process can really happen if the photon 
energy Aw matches the energy of electronic excitation iw = &,, — €,. However, for photon 
frequencies œ which do not match any of the excitation frequencies © 4 , = Ep — &,, real 
excitations cannot occur. Then the light wave will induce a virtual transition to this excited 
state: a virtual process is an excitation which starts to occur but violates energy conservation, 
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so the excitation process ceases. Nevertheless, the electronic system is polarized during this 
virtual excitation process. The atom will have electric dipole moment u whose magnitude is 
proportional to the amplitude of the electric field of the light which is acting upon the atom: 


Hy = 2 ay (0)E e (4.656) 


An expression for the atomic polarizability «,,(@) will be derived below. The Hamil- 
tonian for each atom of atomic number Z will be taken to have the nonrelativistic form 


H = Pe — Ze D> +% “yl (4.657) 
ri 2y 


It is assumed to have solutions which are single-particle orbitals ġ„(r) with energy ¢,,. In the 
Hartree-Fock approximation, the state of the atom is a Slater determinant of the occupied 
orbital states. The ground state is the Slater determinant of the set of orbitals with the lowest 
energy. An excited electronic state is obtained by raising an electron from one of the ground 
state orbitals to another one of higher energy, which is unoccupied in the ground state. Denote 
by Greek subscripts « or B the possible excitation energies @, = €,, — €„. Each excitation « 
corresponds to moving one electron from a ground state to an excited state orbital. The 
operator b} is introduced to describe this excitation, which can be defined in terms of the 
operators a,, of the electronic states: 


bt = ala, (4.658) 


The a, operators are fermions. The b, operators are the product of two fermion operators and 
have boson properties. However, they are not bosons since the number in each state is limited: 
one has that (bi) = 0, where the value N depends on the maximum number of electrons 
which are in state n or may be in state m. However, the b, operators are approximated as pure 
bosons. It is an adequate approximation as long as the density of such bosons is low, so that 
the average number of bosons on each atomic site is much less than unity. Then the restriction 
on the site occupancy of the bosons is irrelevant, and the b, operators obey good boson 
statistics. The Hamiltonian of a single atom is approximated as 


H =E, + hiw,bib, (4.659) 


[ba bb] = dup (4.660) 


where E, is the ground state energy and bib, is the number of excitations. 

The model of an atom is far too simple by today’s theoretical standards. The Hartree— 
Fock (HF) approximation is seldom adequate to describe an atom, and usually one has to take 
a linear combination of HF states (configurational interaction, or CI in the chemist’s notation). 
The single-particle orbitals o,(r) also vary with the excited state configuration, which 
requires a more complicated Hamiltonian than (4.659) in order to describe excitations. These 
complications are omitted, which hopefully are only a side issue for the discussion of exci- 
tons. 

An important quantity for our discussion is the dipole matrix element X(«) for the states 
, and @,, which comprise the transition 


X = | droneo) (4.661) 
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This matrix element will be zero in most cases, since it is nonzero only when the two 
electronic orbitals 0, and ©,, differ by one unit of angular momentum. This matrix element is 
important for the case the excitations occur by the interaction with light. The rate of excitation 
is determined by the dipole matrix element. 

Now introduce an operator P, which is the polarization operator for the atom, where u 
denotes vector direction and « denotes excitation: 


P, =e>-X,(a)(b, + b}) (4.662) 
H 


It should not be confused with the momentum, which has the same symbol. This operator has 
the units of polarization, which is charge times length. One can think of the quantity 
Xba + b}) as a typical displacement operator, which in this case refers to the displacement 
of electronic charge in the atom. It is quite analogous to the displacement operator for the 
harmonic oscillator system, which is X,(ag + ala), where X, = (f /2M Oa)” * is the unit of 
length. 

The symbol Z (iœ) is used to denote the correlation function of the polarization 
operator P, with itself: 


, 
F (iO) = -| dre™™ (T.P (1)P,(0)) (4.663) 
0 
It is easily evaluated for the noninteracting Hamiltonian (4.659): 


BO 
Fin) = -e | dre! Y X (a)X,(a(e* (by, bl) + e°* (bLb,)) (4.664) 


2e°X_ XO, 
Fwlio) = 


(4.665 
æ (io) — o2 ( ) 


The quantity F Go) is actually the polarizability œ „„(îœ@) of the atom. To keep this 
discussion simple, assume that this quantity is isotropic: 


F iw) = a(i) = ô valo) (4.666) 


X*o 
alio) = 2e* Y 5 Hoa 


M 4.667 
a Wy — (io) ( ) 


So far the discussion has been confined to the properties of a single atom. Next consider the 
additional properties which result when the atoms are collected together in the solid. The 
discussion about photons is deferred to later, and now consider only the original Hamiltonian 
(4.657) as applied to a collection of atoms. The electrons on one atom will interact with the 


nucleus and electrons on adjacent atoms. Denote the position of an electron r; = R; + x,, 
where R; is the position of the atom and x; is the displacement of the electron from the center 
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of the atom. For two electrons on different atoms, the interaction is expanded assuming that 


tL 1 (x; — x): (R, - R;) 
I; = r| IR; —R,| IR; — Rl’ 
+ FD; — yp wR — RG — &)y $0 (4.668) 
ò 3R R 
__ “w u4*v 
Puy (R) — R — R5 


The first two terms can usually be neglected, since the first is a constant, and the second will 
vanish for crystals with inversion symmetry. The first important term, resulting from the 
interaction between electrons on different atoms, is the dipole-dipole interaction: 


Xin Pyy(R; _ R,)Xjy (4.669) 


The standard practice is to retain only this term in the theory, which is called the dipole 
approximation. Sometimes it is improved by also including the next terms, which involve 
quadrupoles and octopoles. For a treatment of exciton theory without making a monopole 
expansion but instead retaining all the terms in the electron—electron interaction, see Agra- 
novitch (1960, 1961) for the formalism and Mahan (1975) for a sample calculation. 

The notation needs to be expanded to include the site / of the atom in the solid. The 
exciton operators are referred to as b, and polarization operators as P,,(/). For one atom per 
unit cell, each quantity may also be expanded in wave vector space: 


PD = eX, (a)(b,,, + bt) (4.670) 
P (k) = 7 3 el RiP (1) =e Y X, (0)bax + Oh) (4.671) 
bak = 7 Ye Rp (4.672) 

l 


In the dipole-dipole interaction (4.669), the dipole moment of the electron ex;, is replaced by 


the equivalent displacement operator P,,(i). The dipole-dipole interaction between excitations 
on different atoms becomes: 


e2 l 
T Ee u DlR; = Rv > 7 X P ODR; = RIP) 


ijuv 
-TT P TPK) (4.673) 

0 pvk 
Tull) = Few OR) (4.674) 


The interaction between excitations on the same atom is omitted, since this contribution is 
already included implicitly in the free-atom Hamiltonian (4.657). The last step above gives 
the interaction in wave vector notation. The excitations in wave vector space interact with the 
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lattice transform of the dipole-dipole interaction, which is called 7,,,(k). This function is only 
needed at small wave vectors, which for cubic crystals is (Cohen and Keffer, 1955) 


mn — 45, + Olka) (4.675) 


lim Tyy(k) = 
where a is a lattice constant. If one tries to evaluate this function exactly at the point k = 0, it 
amounts to summing the dipole-dipole interaction over neighboring sites, which gives zero. 
The function is conditionally convergent, since the limit as k goes to zero gives a different 
result than at the point that k = 0. The limiting value is correct. 

The Hamiltonian under discussion is the original atomic part (4.659) plus the dipole— 
dipole interaction term. The atomic part can also be expressed in wave vector notation, so 
consider 


1 
H = X Ob, bak + 5 2 VupKN buk + bf Kp,- + 5 x) 


4.676 
Atte” l ) 


Vp(K) = n A (aT (k)X, (P) 


This form of a Hamiltonian has been diagonalized before, see (4.438). However, the present 
case is more complicated, because of the summation over the different excitations « and B. 
Equation (4.676) is solved by the use of a Green’s function. The correlation function (4.663) 
is redefined to include wave vectors: 


B 
F (k, i@) = — | dte®™(T,P (k, t)P,,(—k, 0)) (4.677) 
0 


This correlation function is evaluated by a diagrammatic expansion. The first term in the 
Hamiltonian (4.676) is treated as Hy, and the second term is the interaction V. For this Ho the 
unperturbed Green’s function was evaluated earlier in (4.664) and is the negative of the atomic 
polarizability: 
2e7X Xo . 
Fk, io) = = tlio) (4.678) 
x (i@,) — Oy 

The self-energy term for the perturbation V occurs in first order. An important feature of the 
evaluation of this term is the me of the potential Vyp, 


V=— ar P,(k)T,y(K)P,(—k) (4.679) 
Vo vk 


so that the different « and B summations just go into the factors P,(k) and P,(—k). For 
example, the first-order term in the expansion of the S matrix is 


F (k, iœ) = zo f dre f dt, >. = yk) 


p’v’k’ “O0 
x (TP, Ak, PP, (—K', t,)Ê,(k', t,)P,(—k, 0)) + 


= F%(k, io) +a LF FAK, 10) Tyy(K)F yolk, 10) + + + 
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The self-energy is 4n7,,,/Vo. It is the only self-energy term, and higher-order terms in the S- 
matrix expansion just produce multiples of this term. The Dyson equation for the correlation 
function is the matrix equation 


. 4r , 
F „h(k, iœ) = FO (k, io) + v 3 FV) (k, i@)Tyy(k)F ylk, io) (4.680) 


This equation is not too hard to solve in the general case, since it is only a 3 x 3 matrix in the 
direction variables (u, v) = (x, y, z). We shall only consider the simplest possible case, which 
has an isotropic polarizability F ”) = —6,,a and the long-wavelength form of the interaction 
Ty as given in (4.675). Then the equation to be solved simplifies to 


Ana fkih, Oy 
F yy = —Ò phia- aa, (4.681) 
0 à 


At small values of wave vector the matrix F „y depends only on the scalar functions C and D, 
kk 
+ D#7 (4.682) 


We can generate equations in only C and D by inserting the above results in (4.681) and then 
equating coefficients of the terms 6,,, and k ky /k? 


Ana 
C = — — C 4.683 
a + 3v, ( ) 
Ara > 
0 


These two equations for C and D can be solved algebraically: 


o4 

C=- — 

1 — 4ra/3vo 

a(4ra/vo) 
D= ————— ~> WV 
(1 — 4ra /3v)(1 + 8na/3vo) (4.685) 
F = ee 5 — 4na kyky 

Wy 1—4ra/3vo| "  1+8ra/3v k? 


Equation (4.685) is the long-wavelength form of F (k, iœ). 

The quantity — F „y is the polarizability of the solid, just as —F A is the polarizability of 
the atom. The interactions in V modify the atomic polarizability. These modifications are just 
the local field corrections. In a cubic dielectric, the dielectric function at long wavelength has 
the Lorenz—Lorentz form: 


ATA Vo 


SO 4.686 
1 — 4na/3vo (4.686) 


e(@) = 1+ 
The factor 4ma/3v,) in the denominator is a local field correction. The same factors are 
derived in the first term of F „y in (4.685). The atomic polarizability « has dimensions of mî, 
so the polarizability per unit volume &/vọ is dimensionless. Often this combination itself is 


written as a. In any case, the dielectric function (4.686) may be derived from the correlation 
function F „y in (4.685). 
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Now solve the Hamiltonian for the interaction between excitons and photons: 


1 e 
H = Y Oya} ayy + = > |p, —ÊA(r, |+ are (4.687) 
Àk mM i C 


= 


This Hamiltonian can be rewritten in terms of the operators for excitons and photons. The 
electron—electron interaction is written in terms of the dipole-dipole approximation, as 
discussed above 


27 
H = = Okala Aer + 3 Oabl Dak + Vs 2 P,(kK)T,,,(K)P\(k) 


J (k) = m e el 7 bak) 


(4.688) 


pù) = -i | dre, o- in (r) 


where the operator J,,(k) is the current operator for the exciton system. The two operators J, 
and P, are simply related. The classical expression has the current as the time derivative of 
the polarization: 


ð 
J= ae (4.689) 
J.,(k, 1) sin ð P,,(k, 1) (4.690) 


This expression is also correct for the operators, which can be derived from the relation 
p = —im|r, H] between momentum and position. 

The photon Green’s function is now obtained by the same procedure as used in Sec. 
4.6.2. The two self-energy terms are evaluated as before. The first-order self-energy, which 
comes from the 4? term, is evaluated with the same result [see (4.647)]: 

eZ 


I) = 8 


4.691 
w= (4.691) 


uv 
The factor of Z is the number of electrons in each atom in unit volume vp. It enters into the A? 
term because the Hamiltonian in (4.687) contains the summation over i, which is the 
summation over all electrons, i.e., the summation over each atom and then the summation 
over each electron on each atom. The plasma frequency in this system is then 
4nII") = 4neZ/(mvo) = o. 

The other self-energy contribution comes from the J+A term in the Hamiltonian. In 
analogy with (4.652), this term in the self-energy arises from the correlation function: 


MI) = (k, io) = if de (T J (k, t)J,(—k, 0)) (4.692) 
Vo 


This correlation function needs to be evaluated. It will be found by relating it to the corre- 
lation function for polarizability F „y which is defined in (4.677) and solved in (4.685). This 
solution is the appropriate one for our case, since F „y has been evaluated with the full 
exciton Hamiltonian, which includes all terms in the exciton—photon Hamiltonian (4.688) 
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except the J+A and A? terms. The exact solution is found for the photon self-energy, for the 
Hamiltonian (4.688), if we can relate ©,, to F. They are related by using the operator 
identity (4.690), 


and then integrating by parts on the t variable. The first integration by parts brings 


© (k, io) = — if are (T, À PrE, JES | P\(—k, “| 


t’/=0 


| 
Pplk, i0) = —F-([P Ck, B) ~ Py Ck, 0) (=) | 


-2f a e" {TP (k, ES -P,(— K.)| 
0 


Vo 


| 


7’—0 


The argument of the correlation function is rewritten as 
0 
-(T,P,(k, 0) P(-k, =) (4.693) 


and another integration by parts brings us to the expression 


D, (k, i) = moll p-ro) ) 
t=0 


— o (Puls 0)[P\(—k, —B) — P,(—k, 0))) 


to worl dre (T,P (k, t)P\(—k, 0)) 


The last term has the desired form of (iw,)’F uv: The first two terms, which are constants of 
integration, must be evaluated next. The second term is easy, since it is zero: 


(P (k, 0)[P,\(—k, —B) — P,(—k, 0)]) = Tr{e P” [P, (k)e >" P,(—k)e’” — P,(kK)P,(—k)]} 
= —Tr{e>"[P (k), P,(—k)]} = 0 (4.694) 


where the cyclic properties of the trace are used to rearrange the first term. The first term in 
@,,, requires more care, since it is nonzero and makes an important contribution to the result: 


1 OP, 
R=— z (P,& P) — P, (k, 0)] (a 


] | (2) (2 i 
Vo at! 1/=0 H H at’ t’=0 Vo Woy 


The commutators of J and P can be evaluated from their definitions: 


R = — — > X (op (B)[bak + bs Diy — 5p, 4] 


— 2 xan (4.695) 
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To evaluate the remaining summation over «, rewrite the momentum matrix element as 
ip,/m = |x, H], so that this term is 


1 
m p(x) = iœ X, (œ) (4.696) 
2e? e? 
R = —— » OX, (X (a) — Suy » Ja (4.697) 
Vo a MVo a 
20,X7(a)m 
= aa om (4.698) 
The summation over oscillator strength f, can be trivially evaluated by using the f-sum rule, 
2 hZ (4.699) 


where Z is the number of electrons on each atom. These steps show that the first term in 
(4.694) is just 


eZ 
r-—£45 4.700 
a (4.700) 
12-0, =- 4 (GOF (4.701) 
py ~ pv ag W uv . 


The two self-energy expressions may be added, 
4n(T1 + 1) = 4n(io)’ F (k, io) (4.702) 


The plasma frequency cancels from the result. The photon self-energy function is just 
proportional to the correlation function F „ of the polarization operator. Earlier it was called 
the polarizability of the solid. It is given in (4.685). Next, solve the Dyson equation for the 
photon Green’s function: 


Dy =D) + Z D Tu Day (4.703) 


By using the fact that 2, and DY) both have the factor (ò y — k,4,/ k?), it is easy to show 
that the term D(k ky/ k?) in F „v Makes no contribution. The solution to (4.703) is 


An (8 yy — kky /k?) 
NY (iw) e(i@) — ck? 


where the dielectric function is given in (4.686). The photon Green’s function is shown to be 
governed by the transverse dielectric function e(i@), which has the Lorenz—Lorentz local field 
correction. The propagating normal modes are governed by the relationship @7s(@) = ;, 
which gives the polariton modes. The quantity J, is the Green’s function for the polariton, 
since it has the correct dispersion relation. This interpretation is proper, since the function has 
the true eigenfrequencies of the exciton—photon system. 

Since both the exciton and photon system are bosons, the problem could have as easily 
been solved from the exciton viewpoint. Then the photons merely give rise to self-energy 
corrections for the exciton system. When calculating the self-energy of the exciton, with all 
photon effects included, the same dispersion relation is derived for polaritons. Since the 
photon and exciton systems have equal footing, the problem can be treated successfully from 
either starting point. The solution from the exciton system is assigned as a problem. 


(4.704) 
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The only real difference between the phonon and exciton solutions, in Sec. 4.6.2 and this 
section, is the inclusion of the local field corrections for the excitons. They arise from the term 
in the Hamiltonian describing the dipole—dipole interaction between polarization modes on 
different atoms. Of course, the phonon system also has this term in the Hamiltonian, which 
should be included in the analysis. 


Problems 


1. For the hard sphere potential, show that the diagonal reaction matrix is 


h? jy(ka) 


Rik, k) = 4.705 
(k= TET Gy (4.705) 
Next show that the exact off-diagonal reaction matrix is 
h? j,(k'a) 
Rk’, k) === 4.706 
(kK) = a (4.706) 


(Hint: Let the hard sphere be a finite step with height Vj, and solve for V) — ov.) What is the T matrix 
for the hard sphere? 


2. Consider a one-dimensional harmonic chain of infinite length. Let one atom have a mass m different 
from the others of mass M. Solve exactly for the vibrational normal modes, and obtain the condition for 
the formation of a local mode of vibration. Does this happen when the impurity mass m is lighter or 
heavier than M? 


3. Consider the vibrational modes of a three-dimensional solid. Let all atoms be alike except for one. 
Obtain the equation which determines the frequency of the normal mode. (a) Isotope effect: let the one 
atom have a different mass but the same force constants. (b) Impurity: let both mass and force constants 
be different. 


4. For the two-dimensional electron gas where the particles have kinetic energy £, = k? /2m, derive: 


e The formula for the cross section in terms of the phase shifts. (Question: what are the dimensions 
of cross section in 2D?) 

e The formula for the on-shell 7-matrix in terms of phase shifts. 

e The Friedel oscillations dn(r) in terms of phase shifts. 

e The change in the ground state energy due to the impurity in terms of phase shifts. 


5. Let two particles move with the same constant velocity v in a phonon field. The Hamiltonian is 


M , , 
H =v: (pi +P) + onata + Alet + ga-ro) 
2 V 


(a) Show that the interaction energy due to phonon exchange is (r = R} — R,) 


M? . 
Vir, v) = -2 y 24 08d) (4.707) 
q V ®y—-q'v 
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(b) For a Debye model with piezoelectric coupling, this expression may be approximated as 


(00) dQ iq:r 
aq a (4.708) 


Vir, v) = —M, x 
(rv) TE- 4m c,-—q°v 


Evaluate this integral as a function of r and v. 


6. Let the following Hamiltonian describe the phonon interaction with a collection of constant velocity 
particles: H = Hy) + V, where 


Hy = Dv," Pj + È @qaq%q (4.709) 
J q 
1 , 
V = Fe May Lery (4.710) 
q J 


where r, and p, are the position and momentum of the particles. Evaluate 
U(t) = (ete) (4.711) 


where the average is over the phonon thermodynamics. 


7. The coherent neutron scattering from the atoms in a solid is described by the correlation function 


1 , , 
Sq.) = g DEO eR) (4.712) 
J 


For a crystal, write R,(¢) = R” + Q;(t¢), where Q,(t) is the displacement due to phonons. Find the exact 
result for this correlation function. 


8. Derive a result for the second moment 
f = (a°) — (0) (4.713) 


for the spectral function (4.283). See (4.269) for definitions of (w”). 


9. Show that the free energy of the Fano—Anderson model can be written exactly as F = Fy + OF 
where 


BFy = 2 > Infl + cP] + nfl + e Pe] (4.714) 
k 
X(ip) 
SF =| dry ———__— 4.715 
pe, E0) (4.715) 


Evaluate this expression at zero temperature, and show that it agrees with Fumi’s theorem. 


10. Solve for the density of state of the Bethe lattice with z = 2: Show that it is exactly equal to the 
results for a simple one-dimensional chain of sites. 


11. Integrate the density of state p(Q) for the Bethe lattice and show that the total number of states in 
the band is one per site, as expected. 
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12. Use perturbation theory to evaluate the Green’s function for the Bethe lattice 


BO o 
4(1, ip,) = — | dre" (T C, 1 @)C1 (0) (4.716) 
p2m+l 
= = (ip, + pe"? foamy) (4.717) 


where j + 1 is a near neighbor of j. Show that the Green’s function has the expansion indicated in the 
second line. Find the first three functions f, f3, fs, where z is the number of neighbors. Can these be 
summed into a Dyson’s type of equation? 


13. Solve the Ising model for a one-dimensional chain. Start at one end of the chain, and start 
averaging over spins, and derive a self-consistent equation for the partition function. 


14. Solve the Ising model for the Bethe lattice for the case of antiferro-magnetic interactions (J < 0). 
For h = 0 prove that it has the same transition temperature as the case of ferromagnetic ordering. 


15. Solve the Tomonaga model when the interaction 
1 
U= 2 U,[p(k)p(—k) + o(k)o(—A)] (4.718) 


is added to the original Hamiltonian (4.465). Diagonalize this new Hamiltonian, and use it to evaluate 
the Pauli spin susceptibility (4.523). 


16. Consider the static density-density correlation function 


L 
AE) =7| APEE =E oske) 


(a) Evaluate this for the noninteracting electron system at T = 0. 
(b) Use the Tomonaga model to evaluate this for the interacting electron system. 


17. In the spinless Luttinger model, compare the commutator [‘Y (x), Hy] as obtained in the fermion 
representation and the boson representation. 


18. Calculate the retarded density—density correlation function for the spinless Luttinger model: 


Bo 
u(p. ia) = | dre" tT,plp, DP(-p, 0) (4.719) 
0 
p(p) = PiP) + PoP) (4.720) 
19. Interesting effects happen to the susceptibilities x(q, œ) when q ~ 2kp. This wave vector region 
may be evaluated in the Luttinger model by taking operators which are the product of particle-1 and -2 


fermions. For example, evaluate the correlation function x(q, œ) in the noninteracting spinless Luttinger 
model, where 


A(q) = | ret 09 ¥, 09 (4.721) 


x(q, t) = (Alq, DA'(G, 0)) (4.722) 
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20. Find the eigenvalue equation of the boson modes of a one-dimensional electron gas described by 
the spinless Luttinger model, which interacts with phonons. The ds are phonon operators: 


M 
H = vF 2 K(a} a1, — a} ,azs) + 2 Odid; + 3 Fe + d’ Dipa) + P2(9)] (4.723) 
all q 


21. Write the Hamiltonian (4.641) in terms of harmonic oscillator coordinates for both the photons and 
the phonons. Solve it as a coupled oscillator problem. 


22. Solve the polariton Hamiltonian (4.688) from the exciton viewpoint. Calculate the self-energies of 
the exciton, including those arising from the interaction with photons. You may simplify the problem by 
neglecting the local field term. Your answer should have the poles of the exciton Green’s function at the 
polariton modes. 


Chapter 5 
Homogeneous Electron Gas 


The use of diagram techniques in many-particle physics began in the early 1950s, soon after 
their introduction into field theory. Although these methods were applied to a variety of 
problems, some areas of work were more successful than others. The two areas which enjoyed 
early success were the homogeneous electron gas and the polaron problem. Later there were 
other successes such as the theories of superconductivity and superfluidity. However, the 
theory of the homogeneous electron gas, as it was initially understood, was worked out by 
many contributors during the period 1957-1958. They brought a variety of theoretical 
approaches to this problem, but all used diagrammatic techniques in some form. On the other 
hand, during the past twenty years, there has been achieved an understanding of electron- 
electron interactions in strongly correlated metals. This latter topic is covered in the next 
chapter. 


5.1. EXCHANGE AND CORRELATION 


The homogeneous electron gas is described by the Hamiltonian 


H = E &pChoCoo +5 > 5 vC iao Ck'-q,0'Ck,o' CKO 
po 


V kk'oo' q40 
2 
P 
Ep = Pm (5.1) 
Ane* 
Ug = ge (5.2) 


which was derived in (1.164). The electrons are free particles, which mutually interact by 
Coulomb’s law e?/r. There are N, electrons in a large volume v, with an average density 
Ny = N,/v. A positive charge of density ng is spread uniformly through the volume v. The 
positive background maintains the overall charge neutrality of the system. The homogeneous 
electron gas is also called the jellium model of a solid. 

There are two electronic properties which will be evaluated in this section. First is the 
self-energy of an electron of momentum p. Second, the properties of all the electrons will be 
averaged to obtain the total energy of the system. This average is done at zero temperature, 
which derives the ground state energy. The important quantity is the ground state energy per 
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particle E,, which can depend only on the particle density E,(no). The total energy of the N,- 
particle system is just N,E, = Ep, since surface effects are ignored. 
The parameter r, is universally used to describe the density of an electron gas, 


3 
“tots p=] (5.3) 


where dy is the Bohr radius. In an electron gas with uniform density nọ, r, is the radius in 
atomic units of the sphere which encloses one unit of electron charge. Thus r, is small for a 
high-density electron gas and it is large for a low-density gas. Other properties of an electron 
gas may be expressed in terms of this parameter. The density may be related to the Fermi 
wave vector, 


dp | f >, k 
0 i p n2 0 P ap ( ) 


so that the Fermi wave vector and energy are related to r,, 


On\ "7 (dan NP /9n\'71 1.9192 
ranean (A-C 


h2k2 h2 3.6832 i À (65:5) 
F 2 ° 
Er = gp T AFT) (iz) =“ En 


2 
Fs 


where E,,, = 13.60 eV will be the standard unit of energy. Similarly, the plasma frequency is 


Aren) AN 3.4641 


rs 


In the homogeneous electron gas, the average kinetic energy of the electrons is going to be 
proportional to Ep ~% (K.E.) ~ kž, which, by dimensional analysis, is inversely proportional 
to the square of the characteristic length of the system, which is r,. Therefore (K.E.) œ 1/72. 
Similarly, dimensional analysis suggests that the average Coulomb energy per particle will be 
e? divided by the characteristic length, or (P.E.) x 1/r,. When the electron gas has suffi- 
ciently high density, which is small 7,, the kinetic energy term will be larger than the potential 
energy term. In this case, the electrons will behave as free particles, since the potential energy 
is a perturbation on the dominant kinetic energy. In the high-density limit, the free-particle 
picture is expected to be valid. This limiting case will be investigated below. The kinetic 
energy and potential energy contributions will be calculated. The potential energy terms 
cannot be found exactly, but the result can be expressed as a power series—with logarithm 
terms—in the parameter 7,. This series should be accurate at small values of r,. A term-by- 
term derivation of this series is derived below. Later, in Sec. 5.2, the other limit of low density 
is considered where the potential energy is larger than the kinetic energy. 
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5.1.1. Kinetic Energy 


The first energy term is the kinetic energy. For a single particle it is e, = k*/2m. The 
contribution to the ground state energy is obtained by summing over all the particles in the 
ground state: 


(2 ) 2m P no 2n2m 0 
3 hk? 
=55 Ne =iE,N, (5.7) 
2.2099 
2 Ep = r2 Ey (5.8) 


The average kinetic energy is 2E 7, Which is given in terms of r,. 


5.1.2. Hartree 


All the remaining terms in the energy come from the Coulomb interaction between the 
particles. This contribution has not been evaluated exactly. Instead, approximate expressions 
are obtained by a variety of means. Our derivation follows that of Gell-Mann and Brueckner 
(1957). They start by examining the terms generated by ordinary perturbation theory. The first 
term which occurs is the Coulomb interaction between the electrons and the uniform positive 
background, which is called the Hartree interaction. In the model of the homogeneous 
electron gas, the time-averaged electron density is uniform throughout the system, as is the 
positive background. These equal and opposite charge densities exactly cancel, so that the net 
system is charge neutral. The Hartree energy is zero. That is, this energy is given by the 
equation 


N,£o = 


2 frd 
[er — pæ Dlip.) — p; (5.9) 


2 J |r; — r| 


but the ion and electron particle densities are p, = p; = No, and the contribution is zero. This 
fact has already been used in writing Eq. (5.1) by the omission of the q = 0 term from the 
Coulomb interaction. The q = 0 term is the direct Coulomb interaction among the electrons. 
This term is omitted because it is canceled by the direct interaction with the positive back- 
ground. 


5.1.3. Exchange 


The Coulomb interactions in (5.1) provide other energy contributions in addition to the 
direct term. The Hartree term corresponds to the following pairing of the operators in (5.1) 
when the average value of H is evaluated: 


> (Cha, Cigo Cro Cko) ~ Sg=0 (eer 2 (CÈ y Cro) 
kk’ oo’ ko k'o’ 
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Another way to pair the same operators is 


» (Chg. oC —q,o’ Cy oCko) © ~ » (Chig oCo MCE ao Cko) 
kk’oo’ kk’co’ 


=- 2 Mk+q”k 
ko 


This arrangement requires that o’ = o and k’ = k + q. The minus sign is from exchanging 
the order of the fermion operators. This term is called the exchange energy, or the Fock 
energy. Retaining both terms is called Hartree-Fock. The exchange term was derived 
previously in (2.138). It gives a contribution to the energy of an individual electron, as well as 
a contribution to the ground state energy of the collection of electrons: 


1 
E(k) = => E vgra (5.11) 
q 
] 
Eg = zy L MÈ (5.12) 


The self-energy &,(k) depends only on the magnitude of the wave vector k of the particle and 
not upon its energy variable ip, or œ. The wave vector integrals are elementary (p = k + q, 
v = cos 8): 


Ëp Ane 


2. (k)=-— — n 5.13 
© Jep re” 613) 
er , fi dv 
=- d c 5.14 
T l, P P| k? + p? — 2pkv (5-14) 
e? (4r k+p 
— £ L P 5.1 
a ax (5:13) 
l| — l 
£ (k) = (1 a nf =) (5.16) 
k 
Kr 
A particle at the Fermi energy k = kp has y = 1 and 
2 
zke) = -E (5.18) 
It is convenient to write 
ekr 
z, = —F iy) (5.19) 
l — 1 
S0) = ( +e nf i) (5.20) 


where S(y) is a function which gives the wave vector dependence of the exchange energy. 
This function is shown in Fig. 5.1. Its value at y = 0 is S$ = —2. In the vicinity of y = 1 it 
rises steeply and approaches zero at large values of y. At the Fermi energy the value is 
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#4: k/k; 


0.5 1.0 LS 2.0 


2 
S( )=,(6 /(e°h_/7) 
ò 


N 
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FIGURE 5.1 The unscreened exchange energy of the electron gas. 


S(1) = —1. This point is interesting because the function S(y) has an infinite slope. The 


derivative of S(y) is 
1 1 
a) = = (= +y) inf =. 2) (5.21) 
y l -y 
The derivative has a logarithmic divergence as y —> 1. This fact is interesting because it 


predicts that the effective mass is zero. The effective mass of a particle was defined in (3.160). 
Since the exchange self-energy is not frequency dependent, the effective mass is given by 


(=) =| HELW (5.22) 
m 
_ em H 
= Tk dy S) (5.23) 
_ em 1 (1t+y*, |l+y 
=e (= inf >| -2) (5.24) 


which diverges at the Fermi energy y — 1. If the inverse effective mass really diverged at the 
Fermi surface, it would have several observable consequences. The electron gas would be 
unstable at low temperatures, and the specific heat would diverge. Many metals have unusual 
properties at low temperatures, e.g., some become superconducting, while others become 
magnetic. However, simple metals such as the alkalis are stable at low temperatures, so that 
this exchange instability is regarded as being absent. In fact, an examination of further terms 
in the perturbation theory produces another divergence in the effective mass which exactly 
cancels the one due to exchange. The effective mass and specific heat are not divergent. This 
subject is pursued further in Sec. 5.8. 

The exchange energy contribution to the ground state energy is obtained from (5.12). 
Summing over spins gives an expression which is easy to evaluate 


ak 
a zk) =— maton m2 (0 
k? (ekp l +y 
Tej (1n res z) 
Ey = 3" (5.25) 


The average exchange energy per electron is 5 of the value at the Fermi energy. The additional 
factor of 5 enters the ground state energy to account for the fact that the exchange energy is a 
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pair interaction. In terms of the parameter r, the total ground state exchange energy per 
electron is 


2 0.91 
-)=- 9163 (5.26) 


3 
E,=-— — 
gx Mn (kr ay) (= 


Fs 


where the earlier result for (krao) was used in the final expression. 
So far two terms have been found for the energy of the particle, 


272 
E(k) = E +E (k +-- (5.27) 


The corresponding two terms for the ground state energy per particle are, 
2.2099 0.9163 
— + 


2 
Fs Fs 


Eg = 


(5.28) 


The ground state energy has the appearance of a power series, in increasing powers of r,. 


Although it is usually unsafe to extrapolate from just two terms, in fact E, is a series inr,. The 


next term will be of order O(r°) . The zeroth power could be interpreted as either a constant or 
as In(r,). In fact, both of these terms are present. The series has the form 


2.2099 0.9163 


E, = — 0.094 + 0.0622 In(r,) + --- (5.29) 
r; r 


S 


where the last two terms were first given by Gell-Mann and Brueckner (1957). This result has 
since been obtained by a variety of perturbation techniques. The Gell-Mann and Brueckner 
derivation was by Rayleigh-Schrödinger perturbation theory. Alternate derivations have been 
presented by Sawada et al. (1957), Hubbard (1957), Nozieres and Pines (1958), and Quinn 
and Ferrell (1958). Our derivation follows Quinn and Ferrell. 

The above energy terms comprise the Hartree-Fock theory. It is defined to be the kinetic 
energy, the Hartree energy which is zero, and the exchange energy. Wigner and Seitz (1933, 
1934) suggested the name correlation energy to mean the energy terms beyond Hartree-Fock. 
The name is applied both to the additional energy terms in the self-energy of an electron of 
wave vector k, and to the ground state energy obtained by averaging over all of the electrons. 
The total ground state energy per particle is written as 


2.2099 0.9163 
— + 


2 
Fs Fs 


E; = 


E, (5.30) 


where the correlation energy E, needs to be determined. Of course, some terms are known for 
its power series in 7,: 


E, = —0.094 + 0.0622 In(r,) + O(r,) (5.31) 


This result is accurate in the limit of r, —> 0. There is some uncertainty regarding the 
maximum value of r, for which these few terms provide an accurate description. The radius of 
convergence of the power series is about r, < 1. Actual metals have values of r, up to about 6, 
and this series does not give sensible numbers at these low densities. 

The term correlation energy is often applied to other quantities besides the total ground 
state energy. For example, the correlation energy of a particle of wave vector k are those terms 
beyond Hartree-Fock: 

hk? 


E(k) =~ + E(k) + Elk, thy) (5.32) 
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The self-energy from correlation 2 ,(k, ik,,) depends upon the particle energy ik,. The energy 
and wave vector can be averaged to obtain the contribution £, to the ground state correlation 
energy. 


5.1.4. Seitz’s Theorem 


The derivation of Quinn and Ferrell (1958) does calculate the correlation energy per 
particle. They then average over electron states to obtain the correlation contribution to the 
ground state energy. This averaging is based on a theorem of Seitz (1940) which relates the 
ground state energy to the chemical potential. The chemical potential is defined as the energy 
it takes to add or remove an electron from the material. It is the energy which divides the 
empty from the occupied states at zero temperature. Of course, it is just the Fermi energy of 
the metal. The chemical potential is the energy of an electron of momentum kp: 

h2k2 
b= + È (kr) + Re &(kp, 0) (5.33) 
where ik, = 0 is the chemical potential. The chemical potential u is only a function of the 
density of the electron ny. The theorem of Seitz is that 


d dE, 
u(no) = dn, [ME (%)| = E; + no any (5.34) 


The proof is based on the definition of the chemical potential. It is the energy difference 
between a system with N, particles and one with N, + 1: 


u = E7(N, + 1) — ETN.) (5.35) 


For an N,-particle system, the total energy is Er = N,E,. For fixed volume v, an (N, + 1)- 
particle system has a density (1/v)(N, + 1) = no + 1/v. The total energy of the (N, + 1)- 
particle system is 


| dE, 
E(N, + 1) = (N; + DE, (m9 + 1/v) = (N. + (Eglo) + i) 


dE, (1 
= N.E, + E,(no) -+ no = + Oj - (5.36) 
dno v 


The difference between this expression and N,E, is the chemical potential and gives the 
assertion in (5.34). In proving this theorem, the volume v is kept fixed, as is the amount of 
positive charge. The (N, + 1)-particle system has a slight charge imbalance, since it has one 
more unit of electron charge than positive charge. However, the extra charge is a negligible 
contribution to the energy. For example, if a body of average dimension L is uniformly 
charged with one unit of charge, then its Coulomb energy is of order e?/L. But L for a 
macroscopic body is L % 1cm, and e”/L is 10~’ eV, which is negligible. For very small 
particles, where the dimension L is nanometers, the charging energy e?/L is important. 

The chemical potential is the negative of the work function. It is the energy required to 
remove an electron from the solid and take it to infinity with zero kinetic energy. If this 
vacuum level is called zero, then the measured work function just gives the negative of the 
value of the chemical potential. However, there is a surface correction to the work function, or 
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chemical potential, but not to the volume part of the ground state energy per particle. That is, 
write the total energy as 


Er = N.Eg + AE, (5.37) 


where A is the total surface area and E, is the energy per unit surface area. For macroscopic 
bodies, £, does not depend on the surface area. However, p does have a term which depends 
on the surface—actually on the surface dipole layer. Write it as y = ug + Ap, where Ap is the 
contribution from the surface dipole layer and ug is from the bulk terms. The surface 
contribution Ap to the work function was first recognized by Bardeen (1936) and calculated 
accurately by Lang and Kohn (1971). The theorem of Seitz actually just refers to up. 

The relationship (5.34) is valid term by term in the energy series. The Hartree-Fock 
terms may be expressed in powers of no: 


2.2099 0.9163 


2/3 1/3 
Eg, HF = r2 z ` 3 Xn -4Y no! 
2.2099 3E 
sAm =a gg, = SOn (5.38) 
0.9163 3 
3 Yn” = =z (3n nag)” 


Applying the theorem for the chemical potential gives 


d 2/3 1/3 ekp 
HB, HF = dn, “0h e.HF) = Xn — Yny = Er — n (5.39) 


The theorem correctly gives that the average kinetic energy is 2 of the value at the Fermi 
energy, and the average exchange energy is : the value at the Fermi energy. These results 
follow from their respective powers of nọ: $ and 3. 

The correlation energy for the ground state is a series in r, or no >’ The theorem may be 


applied to this series on a term-by-term fashion: 


E.=A+Blin(r,) + Cr, +- 


Anna? 3 1/3 
—A—1B1 oo} + ce ——~ a 5.40 
3 n( 3. jt Arn ay + (5-40) 


d 
Ho = m EO =Á —}B+Bin(r,) +4Cr, +> (5.41) 


Quinn and Ferrell used this theorem to make a backward deduction. They first found p, 
by calculating the self-energy terms of an electron at the Fermi energy. Then the above 
relationships may be used to find the correlation energy for the ground state energy. The 
logarithm term has the same coefficient B for the two energy terms. Once this term is 
determined, the constant term for the ground state energy is obtained by adding B/3 to the 
constant term found for the chemical potential. 

The self-energy of an electron, from Coulomb interactions, may be calculated by the 
methods described in Chapters 2 and 3. The self-energy function has an infinite number of 
terms. The basic procedure is to start examining some low-order terms and to deduce which 
terms contribute to the constant and In(7,) terms. The term low-order refers to the order of the 
self-energy diagram, which is the number of internal Coulomb lines. 
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5.1.5. 5%) 


The exchange energy graph is shown as the first diagram in Fig. 5.2. It is the only 
contribution with one Coulomb line. The correlation energy is the sum of all contributions 
with two or more Coulomb lines. There are three diagrams, or self-energy terms, with two 
Coulomb lines. One of these is shown in Fig. 5.2(a). By using the rules for constructing 
diagrams in Sec. 3.4, this self-energy contribution is 


1 

ICOK) =L gy LIME + DIME + DINK +9449) 
vB“ ag iqniqy 

The first summation over frequencies is given in (3.217): 


I GOK 4 g)9(k +q+q)= Nr (Sk+q) = Nr(Sk+q+q’) 


3 (5.42) 
iq, ly + ok+q g Ek+q+g 


The second summation requires a new derivation, since one energy denominator has odd 
multiples of ix/B, and the other has even multiples. The answer is 


Lo kta) L Paerata — beta) + tr Eneg) 


Big idw + Ek+q — Sk-+q+q’ 7 ik, + Sk-+q+q/ -7 kta 7 Ck+q 
where one occupation function is a boson distribution and the other fermion. The boson 
distribution may be eliminated by using the identity 
[Ar(Sk+q) — Ne (Sk+qtq’)a(Sk+9-+4/ _ k+) = nr (Sk+q+q LI! = Nr(Sx+q)] 
which gives the final self-energy term: 
l V Uy 

ECNE) = =ý a II 
v? qq’ Kn + Cktatq’ — k+ = k+q 
x {nr (Sk+q lr kta) — MF Sk+a+q’)I 
+ nr (Čk+r+g)l] 7 Nr(Sxiq)I} (5.43) 


The self-energy of a particle is needed on the Fermi surface. Set k = kp and ik, = 
Ske = kz/2m — u = 0. The terms in the energy denominator largely cancel, 
qq 

m 


Ék + ktat ~~ Ck+q -7 k+ = (5.44) 


FIGURE 5.2 
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The self-energy is 
eym(Pq{[dq 1 
TCD 0 =- Zom za] q 
x {nr (Sxrq [nr (6k+a) — nr Éra] t (5.45) 


This integral is evaluated by using dimensionless units. All wave vectors are normalized to the 
Fermi wave vector x = q/kp, y = q'/kp, 2 = Kp /kp: 


E,, (dxf dy 1 
¥ (2a) — _ 33, |S |S A A _ A 
Iní x2 y2 x+y {np (2 + y)[np(Z + x) np(Z +X + y)] 


+n-(z+x+y)[l -— nr + x))} (5.46) 


The quantity on the right is independent of electron density. The integral is convergent and 
gives a nonzero result. It contributes to the constant term A in (5.41). Since Seitz’s theorem is 
true for each term, this integral must equal the same constant as found for the equivalent term 
for E.. The similar contribution to E, was evaluated by Onsager et al. (1966) and found to be 
=? = Lin(2) — (3/2n?)C(3) = 0.0436. 


5.1.6, 2?) 


The second self-energy term involving two Coulomb lines is shown in Fig. 5.2(b). This 
contribution is omitted because it is part of a sum of diagrams which gives zero. Consider the 
summation shown in Fig. 5.3(a). Each term has one more exchange diagram. All terms may 
be summed by evaluating the exchange energy with an electron Green’s function in the self- 
energy which includes the exchange energy. This summation is given by the self-energy 


1 
È (k) = -B 2 074k +q, ik,,) (5.47) 
q,ik, 
1 
ik, — Ëk+q — 2.(k + q) 
where the Green’s function has a self-energy due to exchange. Since the self-energy 2,(k) 


does not depend on frequency, the frequency summation of the Green’s function yields the 
simple number operator as in (3.219), 


G(k + q, ik,) = (5.48) 


ji , l 
pZ IE +a ik) = nra tZk + Ol e 


FIGURE 5.3 
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and the self-energy is 
l 1 
2 (k) =- V » UMrlSk+q + 2,(k + q)] => v >, Vp-k”Flőóp + 2.(p)] (5.49) 
q p 


At zero temperature the electron distribution function np[6p + 2,(p)] is just a step function 
which equals unity for electrons beneath the chemical potential and zero for those above. This 
step function must also be normalized so that the electron density is still nọ, as in 


d 
m = Mnr, +EP) (5.50) 


The effect of the exchange energy 2,(p) in the argument of np is to change the chemical 
potential. However, the Fermi wave vector kp is the same, even after the exchange energy has 
been included in the occupation. function. The Fermi wave vector determines the density no, 
which is unchanged by the interactions. The addition of the exchange energy must be 
canceled by an equal change in chemical potential. The result is that at zero temperature, 


lim nplés + 2,00] = Okr — k) (5.51) 


At zero temperature the Fermi distribution is still a step function at k = kp. The exchange 
energy &(k) in (5.49) is exactly equal to the one calculated earlier. In Fig. 5.3(a), the 
summation of all terms just yields the value of the first term alone. All of the subsequent 
terms in that series sum to zero, because of a shift in the chemical potential. 

The evaluation of these self-energy terms has become more subtle. It is not just a 
straightforward evaluation of terms in a series. One must understand each term in order to 
gauge its actual contribution. Further examples of this behavior are found in the other second- 
order term. 

The exchange self-energy has a remarkable effect upon the zero temperature electron 
distribution. It leaves kp unchanged. This simple result is a consequence of the feature that 
x(x) is independent of frequency. Most self-energy terms depend on frequency and so have a 
larger effect upon the wave vector distributions: recall Problem 4 in Chapter 3. 


5.1.7. Z2 


The third self-energy with two Coulomb lines is shown in Fig. 5.2(c). This self-energy 
diagram may be written as 


1 
Z IK) = — -5 D gP”, idm) GOUK + q, ik, + ign) (5.52) 
qian 


The closed fermion loop gives the polarization diagram P‘)(q, ig,,). Although one could 
proceed with the evaluation of this self-energy, it obviously has one drawback. The wave 
vector integral appears to diverge at small values of q, when v —> oo, because of the factor vs 


d d 
[3>] Fo (5.53) 


Indeed it does diverge, so this self-energy term is infinite. The divergence is removed by 
summing a series of self-energy diagrams. This series is shown in Fig. 5.3(b). Each term has 
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one more polarization bubble and one more Coulomb line—hence an additional factor of 
v,P®. The summation of these terms gives the simple series 


1 
TEV) = — 8 Eog Zk +q) 
q 


x {v,PO(g) + [v,PO@P + W POOP ++} 
vP O 
vaP™® (g) 


The summation of terms shown in Fig. 5.3 is called the random phase approximation (RPA). 
The “approximation” in RPA is that this series of terms is used to represent the entire answer. 
For example, the RPA result for the electron correlation energy uses Fig. 5.3(b) or (5.54). RPA 
ignores other terms, such as (5.46), which was found from the diagram in Fig. 5.2(a). RPA 
ignores many more terms in each higher order of perturbation series. 

The denominator in (5.54) is a very important quantity because it is very often used in 
calculations. It is the RPA approximation to the dielectric function and is defined as 


Erra (d) = 1 — vP (Q) (5.55) 


= -JE Gk + D5 (5.54) 


The self-energy series may be written as 


1 1 
ECNE = —— Y v, gk + | — 1 (5.56) 
vB 2 kta Eppa (g) 
The “—1” term in the brackets is easy to evaluate. It is 
1 1 
9M +a) = E vyg = È, (k) (5.57) 
VB “7 Vig 


which is just the negative of the exchange energy. Adding the exchange energy to the above 
result gives a term containing the RPA dielectric function: 


(0) 
Lapa(g) = LO%(k) + E(k) = — JE? v, FP (k +q) 


58 
q ~ Eppa(q) (5.58) 


The summation of these two terms is defined as the RPA self-energy. It is frequently used to 
evaluate the self-energy of the electron (see Lundqvist, 1969). It will be discussed in Sec. 5.8. 
The exchange energy is logically included as the first term in the series shown in Fig. 5.3(b). 
The RPA self-energy, defined above, is also called the screened exchange energy. This self- 
energy has one feature which is worth mentioning now, although it will be shown later in Sec. 
5.8: the inverse effective mass no longer diverges at the Fermi surface. A detailed evaluation 
of this self-energy shows that the coefficient B = (2/n”)(1 — In 2) = 0.0622. 


5.1.8. High-Density Limit 
The first two terms in the series for the ground state energy in the RPA are 


E, gpa = —0.142 + 0.0622 Inr, + O(r,, r, In(r,)) (5.59) 
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Adding the Onsager result, for the second-order diagram =” [see (5.46)], gives the result of 
Gell-Mann and Brueckner: 


E, = —0.094 + 0.0622 Inr, +--+ (5.60) 
Another term in the series has been obtained by Carr and Maradudin (1964): 
E, = —0.094 + 0.0622 Inr, + 0.0187, Inr, + ar, + O(77) (5.61) 


They were unable to obtain all the terms contributing to the coefficient a of the linear term in 
r,. They obtained all its contributions except one, and they were able to put limits on its value. 

Carr and Maradudin also discussed the convergence of this series at large values of r,. 
Because of the term In r,, this series is not analytic as r, —> 0, which is the limit of very high 
densities. But the series is accurate in this limit. Metallic densities are roughly 1.8 < r, < 6. It 
is an important question whether the formula (5.61) is valid for these values of r,. Figure 5.4 
shows a plot of the correlation energy found by Carr and Maradudin. Only the terms shown in 
(5.61) have been plotted (a = 0). It diverges at small r,, which is just the effect of the In(r,) 
term. It is negative up to values about r, œ~ 2.5, where it crosses the axis and becomes 
positive. Now there are simple arguments which indicate that the correlation energy must be 
negative. These arguments are given below in great detail. They show that the perturbation 
formula for E, is not valid in the range of metallic densities. 

Large values of r, correspond to the low-density electron gas. Wigner (1934) showed 
that the low-density electron gas behaved very differently from the high-density electron gas. 
At low density, the electrons become localized, and the calculation of the correlation energy 
becomes quite different. The low-density limit of Wigner is discussed in the next section. The 
correlation energy is determined for all values of r, by extrapolation procedures between the 
low- and high-density limits. These extrapolation procedures provide a formula for the 
correlation energy which is valid in the region of metallic densities. 
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FIGURE 5.4 The correlation energy of the electron gas as given by the expansion of Carr and Maradudin. 
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5.1.9. Pair Distribution Function 


The correlation energy is the improvement in the ground state energy beyond the 
Hartree-Fock approximation. The correlation energy may be understood, at least concep- 
tually, only after understanding Hartree—Fock. It is useful to examine an important property of 
the Hartree—Fock theory: the pair distribution function g(r), which was introduced in Sec. 
1.6. The pair distribution function g(r) is the probability per unit volume that an electron is at 
r if there is already one at r = 0. It depends on spin, so there are two different pair distribution 
functions in an unmagnetized electron gas: g,,(r) =g, (r) and gy\(r) =g,,(v). In the 
notation g,.(r) the first spin index s is for the central electron, while the second s’ is for the 
electrons at r. Thus g,,(7) is the probability that a spin-up electron is at r if a spin-down 
electron is at r= 0. Of course, the electrons are not fixed, and they are usually moving 
rapidly. These pair distribution functions are averages for the moving particles. Even the 
electron at r = 0 is not fixed, so that this reference point moves with the electron. 

The pair distribution functions are rather easy to calculate in the Hartree-Fock 
approximation. Here the N-particle wave function is a Slater determinant, 


pa (“1) pa, (2) a 4, (tw) 


pa (71) 4, (72) oa 4, (tw) 


l 
Fina rn) = JN (5.62) 


pa, Cri) pa, (2) n Pa, (Fy) 


where the À, are the quantum numbers which describe the states, and there is one wave 
function for every occupied electron state. The square of this many-particle wave function is 
the N-particle density matrix. The pair distribution function is given by the two-particle 
density matrix. It is obtained by integrating the N-particle density over all but two space 
coordinates. A similar expectation value is taken over the spin variables: 


Ess (T1, r2) =v [drr n Pryl Paal n ry)l? (5.63) 


If the one-electron orbitals Oy, (r) are assumed orthogonal, then this integration just yields the 
sum over all possible pair wave functions: 


2 2 
V 
Zss (f1, r2) = NNA 


pCi) dy (72) 


PD Para) (5.64) 


The sum over A;, À; is over all occupied states, so each pair is summed twice. For the 
homogeneous electron gas, the orbitals must describe plane waves, 


b(n) = ee (5.65) 
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where the x, are the spin functions. The pair distribution function just describes the orbital 
part, so that one averages over spin functions. These spin averages are (¥,X%+) = 1, 
(X4% 1) = 0, etc. The two pair distribution functions are 


Zy (ri — r3) = NND (le™ *r, tik, 'r» |? + jek ‘ry tik, +r, (2) 
152 
2 N\? 
“NND (5) =5 (5.66) 
gmi — r2) = w= YO fel tier elitr |? 
— kk, 
=e ey 
2 
=5[1 — AQ) —1)7] (5.67) 
2 
A(r) = -5 er (5.68) 
Nk 


The summation over k runs only over occupied states. The antiparallel spin distribution 
function g}, = g); =z in the Hartree-Fock approximation. The cross term which results 
from the determinant 1 is zero because of the orthogonality of the two spin functions. There is 
no correlation in the position of electrons of opposite spin. The parallel spin g,, = g,, has a 
definite spatial dependence, which comes from the cross term which is retained since the 
spins are parallel and the spin averages are unity. The function A(r) is calculated to be 


2 fk ay 3 [E 
n(n) = = | Some =f, kdk sin(kr) 


3. 
~ hep! (rkr) (5.69) 


where j; (x) is a spherical Bessel function. The two pair distribution functions are shown as the 
solid lines in Fig. 5.5. The parallel spin g,,(r) vanishes at r = 0 and approaches 5 at large 
distances. The pair distribution function g,,(7) must vanish at r = 0 as a consequence of the 
exclusion principle for fermions. This feature is built into the many-particle wave function 
(5.62): if any pair of orbitals have the same spin and position, these two rows of the deter- 
minant are identical, which makes the determinant vanish. 


FIGURE 5.5 The pair distribution functions for parallel (left) and antiparallel (right) spins. The solid liine is the 
Hartree-Fock approximation, while the dashed line includes correlation. 
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The total pair distribution function for a particle is the combination of the result for 
parallel and antiparallel spin distributions: 


g(r) = 24%) +g C) (5.70) 


In a nonmagnetic system, the same result is obtained if the central particle has spin down. The 
pair distribution function has two features which are important for our discussion. First, there 
is the normalization integral: 


m | riso) —- 1] = -1 (5.71) 


The second is a statement about the ground state energy of the electron gas. It has kinetic 
energy plus Coulomb parts. The Coulomb energy around the hole in the Hartree-Fock 
approximation is 


en r)— 1 
Ecou = m fas P (5.72) 


The integral Ecou 1s related to the exchange energy which is discussed in Sec. 5.6.3. The 
above two relations for g(r) can be checked in the Hartree-Fock approximation. In this case 
g(r) = 4 + gy (r). The normalization integral yields (x = rkp) 


n 6(~ | 
Ng | trene) -3 = -2 [arao = -$f dxj,(x)’ = -1 


while the Coulomb integral yields 


en, { dr en, [r 
Eo =F" Feno- -e ao 
OO . 2 2 
= - ek | dy HOY 3e kr (5.73) 
T 0 X Ar 


These results are correct for Hartree—Fock. It is the same exchange energy per particle which 
was derived by diagrammatic means. 

The pair distribution function g(r) is the distribution of electrons, on the average, about 
any electron. It goes to unity at large distances, which is a result of the uniform distribution no 
of electron charge. That is, the electron charge density is —enpg(r). Since g(r) is less than 
unity near r — 0, the electron charge is depleted in the vicinity of the electron. This reduction 
may be viewed as a hole in the electron density. Wigner suggested the name exchange and 
correlation hole. The hole moves with the electron. 

According to the normalization (5.71), the total charge missing from this hole is one 
electron charge. The factor —n)(g — 1) is the density of hole charge, which integrates to 1. 
The sum rule is really a statement of charge neutrality. The hole has a positive charge density, 
since it is the absence of electrons. The integral of this charge density must be e. Each 
electron of charge —e has in its hole the amount of charge e, and the system is neutral. The 
homogeneous electron gas has a uniform charge density only on the average. To a particular 
electron, the system is not uniform at all, since other electrons are not as likely to venture near 
to it as they are to other points. 

The Coulomb integral gives the potential energy of the system. The electron interacts 
with its own hole charge. It is not influenced by the electrons or uniform charge density 
farther away, since they cancel on the average. Near the electron the positive background 
charge is not canceled by the other electrons, since the other electrons are not as likely to be 
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nearby. In the Hartree-Fock approximation, only electrons of parallel spin make the exchange 
and correlation hole. Since g}, = 4, the antiparallel spins are not affected. 

It is possible to arrange g(r) so that the sum rule (5.71) is still obeyed, yet the potential 
energy is lower (i.e., a larger negative value) than found in Hartree-Fock. This lowering is 
done by permitting g, | (r) to become less than 5 near the point r — 0, as shown by the dashed 
line in Fig. 5.5. There is no particular reason it must go to zero. The only fixed rule is that it. 
may not be negative, since it is a probability. Since the total hole charge must be unity, the 
change in g}, (r) must be accompanied by a change in g,,(7) which causes it to rise more 
steeply with r, as shown by the dashed line in Fig. 5.5. It is these changes which give rise to 
the correlation energy. Changes in g}, and g,, will usually cost some kinetic energy, so that 
one cannot just adjust g,,, to maximize the potential energy alone. The system will seek the 
lowest energy state at T = 0, which is the state with some “correlation” between the motion 
of electrons with antiparallel spins. This “correlation” between the motion of pairs of elec- 
trons is what gives rise to the “correlation energy.” 


5.2. WIGNER LATTICE 


Actual electron systems exist over a range of densities. The range in metals is 
approximately 1.8 < r, < 6. Electron densities inside atoms are also quite variable. It is 
necessary to have a description of the correlation energy which is valid for a distribution of 
densities. In the previous section, a formula was derived for the high-density limit of a 
homogeneous electron gas. The next step is to derive the result for the low-density limit. The 
final formulas are found by interpolating between these extremes. This procedure was first 
carried through by Wigner (1934), and his formula’ 


0.88 


= ——___ 14 
© r +78 (5:74) 


is still widely used. Recent interpolation formulas are only slightly better. At low densities, 
Wigner speculated that the electrons would become localized and form a regular lattice. He 
used the jellium model where the positive charge is uniformly spread through the system. The 
electron lattice would presumably be a close-packed structure such as bce, fcc, or hcp, in 
which electrons would vibrate around their equilibrium positions. There would be vibrational 
modes of the electrons, and they would be at the plasmon frequency. At very low density the 
potential energy becomes more important than the kinetic energy. The kinetic energy is in the 
zero-point motion of the vibrational modes. Localization cannot occur until the zero point 
energy is less than the potential energy. 

Wigner calculated the potential energy in the following fashion. A Wigner—Seitz model 
was taken for the unit cell of the lattice. It is a sphere of radius r,aọ, with the electron at the 
center. Each sphere has overall neutrality, since the one-electron charge at the center is 
canceled by the positive charge inside the volume of the sphere. For a sphere of radius 7,ao, 
there is one unit of charge, which follows from the definition of 7,. Outside of each sphere, the 
electric field is zero. If all unit cells are neutral spheres, then they exert no electric fields on 
each other. The electric fields inside a sphere arise only from the electron and positive charge 
within that sphere. Of course this model is only approximate, since the unit cells are not truly 
spheres. Spheres cannot be packed together to cover all volume. The error made by the 
Wigner—Seitz approximation is remarkably small. 


' Wigner’s original paper contained an error. The corrected result is given. 
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The first term is the potential energy between the electron and the uniform positive 


background. It is 
Fsaq 
Eep = Je- a = -35,| rdr 


__ 2e =- (5) (5.75) 


where —e’/r is the potential energy and ny = 3/(4nr?aĵ) is the density of positive charge at 
each distance r. The result is —3/r, in atomic units, which is a large energy term. 

The second term in the potential energy is the interaction of the positive charge with 
itself. The potential energy V(r) from the positive charge at a distance r from the center is 
obtained by solving first for its equivalent electric field, 


dV e (=) E er 


——=eK(r)= 720 3 = (5.76) 


33 
dr Vr; A 


which is e?/r? times the total charge within the sphere of radius r. Integrating this equation to 
obtain V(r), there is a constant of integration. It is determined by the condition that the total 
potential, from electron and positive charge, must vanish at the surface of the sphere. The 
constant is chosen to make V(r) be e*/r,ap at the surface. The result is 


e r \? 1 r \? 
2r ao f g (=) | E r, £ E (=) lz 6-77) 


The potential energy of the positive charge interacting with itself is found using 


1 3 et P% r \? 
E,, =~|a@rV =2—— | rdr|3- 
PP Al ry rno 4 (r a) l | y 


3 e? 6 
Sry «Sr, |” 


V(r) = 


(5.78) 


The result is multiplied by 4 because it is a self-energy. The final result is 1.2/r, in atomic 
units. There are only two potential energy terms. The interaction of the electron with itself is 
not included. Aside from the fact that it is infinity, it does not change in the metal and so does 
not contribute to the cohesive energy of the system. 

The sum of these two terms is —1.8/r,. It is the total potential energy in the Wigner 
lattice in the Wigner—Seitz approximation. It is a large term, e.g., it has a larger coefficient 
than the exchange and correlation energies which were found for the free-particle system. The 
system apparently has gained energy by the localization of the electrons. 

The actual energy of several Wigner lattices was calculated by Sholl (1967) using 
Madelung summation methods. He found the energy of a lattice of point charges in a uniform 
positive background. He expressed his results as —A/r,, where the parameter A was found for 
several lattices: 


Lattice A 

SC 1.760 
fcc 1.79175 
bec 1.79186 


hcp 1.79168 
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The sc lattice has a unit cell which is not very spherical, and its A is different. But the 
other lattices are more close-packed, and the coefficient 1.792 is remarkably close to the 
Wigner-—Seitz value of 1.8. The latter approximation errs by less than 5%. 

Is the Wigner lattice stable? To answer that question, first consider the potential energy 
on an individual electron in the vicinity of its equilibrium point. In the Wigner—Seitz model, it 
is just the negative of the potential energy from the positive charge: 


2 
V, = -V (r) = -Ż f — (; —) |e (5.79) 


At r = 0 it equals the prior result —3/r,. Away from r = 0 the potential increases quad- 
ratically. If each electron moved independently of the others, then the electron would have 
harmonic vibrations about the equilibrium point. The zero-point motion of these vibrations is 
another term in the ground state energy, and includes the kinetic energy as well as some 
potential energy. The zero point energy was evaluated by Wigner, who found that its 
contribution toward the ground state energy was proportional to rs 3/2 At large r, this term 
becomes smaller than the potential energy term, which falls off as r>'. He concluded the 
lattice was stable at sufficiently large values of r,. Of course the particles do not move 
independently. Their movements form collective vibrational modes. A better test of lattice 
stability is to calculate all the phonon modes and show that their frequencies are all real so the 
lattice is stable for collective motions. This calculation was done by Carr (1961). A summary 
of the theory of the Wigner lattice was given by Care and March (1975). 

In the limit of low density, as r, > oo, the potential energy of the Wigner lattice is 
proportional to —1.792/r,. This term is the total Coulomb energy, which is correlation plus 
exchange. If the exchange energy —0.9163/r, is subtracted from this result, there remains the 
low-density limit of the correlation energy: 


0.8757 
lim E, = — 


ro Fs 


(5.80) 


The numerator is rounded off to 0.88. Wigner also estimated that the high-density limit of the 
correlation energy was 


lim E, = —0.113 (5.81) 


r —>0 


His estimate is now known to be incorrect, because of the In r, term which causes a diver- 
gence in E, at small values of r,. The final formula (5.74) is a simple interpolation scheme 
which satisfies these two limits. This formula is plotted in Fig. 5.6 as the line marked W. The 
interpolation procedure assumes that the correlation energy is a smooth function between the 
limits of high and low density. The interpolation would be reasonable if the system were in 
the same state in both limits. But the two limits describe systems in different phases. The 
high-density system has free-electron states, and the low-density system has localized elec- 
trons. The smooth interpolation through a phase boundary is not obviously correct, nor 
incorrect. l 

Other interpolation schemes have been proposed. One by Nozieres and Pines (1958) is 


E, = —0.115 + 0.031 In(r,) (5.82) 
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CORRELATION ENERGY (RYDBERGS) 


FIGURE 5.6 The correlation energy of the electron gas. The line CM (Carr—Maradudin) is the same small r, 
expansion given in Fig. 5.4. The other curves are interpolation schemes according to Wigner (W), Nozieres and Pines 
(NP), and Lindgren and Rosen (LR). 


It is shown in Fig. 5.6 as the curve marked NP. It is quite similar to Wigner’s result in the 
range of metallic densities. Another interpolation formula has been proposed by Lindgren and 
Rosen (1970) (LR), 


0.0817! 
E, = -|r +3 +47- (5.83) 


S 


which is also shown in Fig. 5.6. It was chosen to agree with Carr and Maradudin at low values 
of r,. Although the LR formula does not have a In r, term, it does agree well with the exact 
high-density expansion at low values of r,. In the range of metallic densities, it has a lower (in 
magnitude) correlation energy than the previous two formulas. 

There have been other calculations of the correlation energies at metallic densities. These 
are usually based on (5.56). The quantity &pp,(q, im) is replaced by a better dielectric 
function—one which is more accurate at low densities. This approximation leads to an 
improved correlation energy. Even a direct evaluation of (5.56) gives a reasonable correlation 
energy for metallic densities (Lundqvist, 1969). There is no need to evaluate only the low r, 
expansion of this result, as we did before. One can numerically solve it for all values of r,. 
Some of these improved dielectric functions are discussed in Sec. 5.5. 
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5.3. METALLIC HYDROGEN 


It has not been possible to study the three-dimensional Wigner lattice in a laboratory. 
There is another system, which is superficially quite similar, for which one could consider 
attempting experiments. It is metallic hydrogen. If one takes the Wigner lattice and changes 
the sign of all the charges, one has the model for metallic hydrogen. The protons are well 
approximated as point charges which are embedded in a free-electron gas. Since all other 
monovalent atoms form metals, perhaps hydrogen may also be made metallic. 

Of course, at ordinary pressures, hydrogen does not form a metal. It is the gas H, at 
room temperatures. The lowering of temperature turns it into a liquid and then a solid. In both 
condensed phases the molecule H, retains its identity, so that the solid is a molecular crystal 
and an insulator. It may be possible to make hydrogen a metal by the application of pressure. 
As modern techniques achieve higher laboratory pressures, new phases of solid hydrogen 
continue to be discovered. However, they all seem to be molecular crystals and insulators 
(Mao and Hemley, 1994). 

The discussion of metallic hydrogen will adopt the approximation that the electrons are a 
uniform electron gas. Better calculations show that the electron density is slightly nonuni- 
form, since the electron density is higher near the proton than at the edge of the unit cell. 
Nevertheless, the assumption of uniformity makes only a small error in the analysis of the 
ground state energy. For a homogeneous electron gas, the electronic contribution to the 
ground state energy is the familiar result 


2.2099 0.9163 


rs rs 


E, = +E.(r,) + Ey (5.84) 
where Ey is the Hartree energy, and E,(7,) is one of the formulas for the correlation energy. 
The first three terms for the ground state energy are for the case where the positive charge is 
spread uniformly throughout the system. In metallic hydrogen, the positive charge is a regular 
lattice of points. The extra Coulomb energy which results from the localization of the positive 
charge was calculated above for the Wigner lattice. Changing the sign of all the charges yields 
the same result: Epy = —1.792/r,. The Hartree energy is the Coulomb energy calculated in 
the Hartree approximation, which neglects exchange and correlation. 

There are, however, some subtle aspects to this calculation worth mentioning. The 
interaction energy between the protons and the uniform electron gas is calculated as before. It 
includes the interactions among the protons, but omits the proton interacting with itself. The 
electron—electron interactions are evaluated assuming a constant charge density nọ. Consider 
the Wigner—Seitz model of a spherical unit cell. Although there is one unit of electron charge 
in each unit cell, it is not treated as a single electron, since the electron—electron interaction 
would then be an electron interacting with itself. Instead, each electron is spread uniformly 
throughout the solid. In the free-electron model, the unit cell has one charge because 107° 
electrons each contribute 107% of their charge. Then the calculation of electron—electron 
interactions has a negligible contribution from one electron interacting with itself. This 
picture is changed by the concept of the exchange and correlation hole. The electrons are not 
charges which are uniformly spread throughout the material. They are points which, on the 
average, may be found with uniform probability anywhere. But if one electron is at a point, 
the others are not, on the average, within its exchange and correlation hole. The radius of the 
exchange-correlation hole is similar to the radius of the Wigner—Seitz cell. The influence of 
the correlation hole is included in the correlation energy. Once it is included, the lattice energy 
from the protons is found by assuming the electron density is uniform. 
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The net result is that the energy from electron—electron interactions is small. In the 
Wigner-—Seitz spherical model, the total energy from electron—electron interactions is 


1.2 02 
Eo = E, +E, + = 37 
r 


S 


+E(r,) (5.85) 


Fs 


Ese is the sum of exchange, correlation, and the term 1.2/r, which comes from the uniform 
density of electrons interacting with themselves in the spherical unit cell. The term 0.28/r, is 
small and positive, while Æ, is small and negative. They largely cancel, so the net electron- 
electron energy is small. The concept of the exchange-correlation hole is correct. If one 
electron is put into a unit cell, then other electrons will not likely be there because of this hole. 
And if other electrons are not in the cell, there are no electron—electron interactions. Thus it is 
easy to understand why the net electron—electron interaction is small. 
The ground state energy for metallic hydrogen is 


2.2099 1 
E, = - — (0.916 + 1.792) + E.(r,) 
rs rs 
2.210 2.708 
= + E.r) (5.86) 
r r 


S S 


E,(r;) must be minimized, with respect to r,, to obtain the predicted density of metallic 
hydrogen; the predicted value of r, is defined as 7,9. The minimization will be done in the 
Hartree-Fock approximation by ignoring the correlation energy FE, because the variation of 
E.(r,) with respect to r, is small at metallic densities. The correlation energy will have little 
influence upon the choice of r. The simple equation to minimize is: 


dEr d (2.210 2.708 4.420 2.708 
0 = =- [22 = ——— + — 5.87 
dr, ral r? Fs ) rs rio l ) 
The minimum occurs at r, = 1.632, and the Hartree—Fock energy at this value is 
2.210 2.708 
Enr (r o) = 7 — = —0.830E,, (5.88) 
Fso Fso 


The Wigner correlation energy at this value of r,. is —0.093, so that the predicted ground 
state energy is —0.923E,,. It is the binding energy per electron. It is not even as large as 
atomic hydrogen, which is 1.00 Rydberg per electron. The prediction is that metallic 
hydrogen is not as bound as atomic hydrogen. Of course, it is possible to do a better 
calculation of £, for metallic hydrogen, which includes the nonuniformity of the electron gas. 
The best result so far is E, = —1.048E,, per atom at rọ = 1.60 by Hammerberg and Ashcroft 
(1974). Although this value is more bound than atomic hydrogen, it is still not as bound as 
molecular hydrogen or solid molecular hydrogen which has r, = 3.3. The conclusion is that 
hydrogen would rather be a molecule than a metal, and this conclusion is in accord with 
experiments. 


5.4. LINEAR SCREENING 


Screening is one of the most important concepts in many-body theory. Charges, which 
are able, will move in response to an electric field. This charge movement will stabilize into a 
new distribution of charge around the electric field. This new distribution is just the right 
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amount of charge to cancel the electric field at large distances. The proof of this is rather 
trivial. If the electric field is not canceled at large distances, more charge will still be attracted 
until it is sufficient for cancellation. If the electric field is caused by an impurity charge 
distribution p,(r), with net charge Q; = f d?rp,(r), the amount of mobile charge attracted to 
the surroundings is exactly —Q,. This fact was already used in discussing the Friedel sum rule 
in Sec. 4.1. The name screening charge is applied to the mobile charge attracted by the 
impurity electric field. It will also have its own distribution in space p,(r). The screened 
potential from the impurity charge and the screening charge is given by 


(rr) — jer pr’) + P(r’) (5.89) 


r= r'| 


This formula is an exact result, as long as p,(r) is found exactly. 

The screening charge is not necessarily in bound states due to the electric field from the 
impurity. Of course, that could happen if the electric field from the impurity is strong enough. 
But quite often the screening charge is from the unbound conduction electrons of the metal or 
semiconductor. In their motion through the crystal, they spend a little more time near the 
impurity potential, if it is attractive, than they do elsewhere in the solid. When these motions 
are averaged, there is more electron density near the impurity than elsewhere, which is the 
screening charge. If the impurity potential is repulsive for electrons, they tend to spend less 
time near the impurity, so the average charge is depleted near the impurity. Here the screening 
charge is positive, since it signifies a reduction in the average density of electrons, which have 
negative charge. 

The classical macroscopic theory is quite familiar. The electric field E and displacement 
field D obey the equations 


V-D(r) = 4zp,(r) (5.90) 
V-E(r) = 4n[p,7) + p,)] (5.91) 


All equations are Fourier-transformed to give 


iq * D(q) = 4np,(q) (5.92) 
iq > E(q) = 4n[p(q) + p,(q)] (5.93) 


The components of D(q) and E(q) along the direction q are the longitudinal fields D,(q) and 
E,(q). The longitudinal electric field is related to the scalar potential E,(r) = —V(r) or its 


transform (q) = iE,(q)/¢: 


Dq) = = p;(q) (5.94) 
E\(q) = a leva) +p] (5.95) 
$q) = =o) + pCa) (5.96) 


The dielectric response function is defined as the ratio D,(q)/E;(q) in the limit where p; — 0: 


oa DAG) _ ,. l p;(q) | 
e(a = wot Eq) — a p;(q) + p,(q) 6:27 
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In this limit e(q) becomes a property of the material and is independent of the charge 
distribution. One of our goals is to calculate this dielectric function. The linear screening 
model assumes this definition is true for nonzero p,(q), which gives for the potential 


o’( ) — 4r p;(q) 


TP ela OP) 
/ — dq 4r p;(q) ig-r 
o (r) = | On @ eq) ef (5.99) 


The potential @’ is the total potential from screening charge and impurity charge. That is 
obvious from its definition, since E(r) is the electric field in (5.90) from both screening and 
impurity charges. The potential p” should be similar to the exact screened potential o in 
(5.89). They are identical in the limit where p; is small. The linear screening approximation is 
to calculate (r) in place of o(r). It is a much easier function to calculate, at least after (q) 
has been determined. Another feature of the linear screening model is that the screening 
charge p,(q) density is proportional, in q space, to the impurity charge density p,(q). Linear 
screening models assume that 


p;(q) 
= —————— 0 
e p:(q) + p,(q) (5.100) 
p(q) 1 
pq) elq) l (5.101) 


are valid for finite values of p,, rather than infinitesimal ones. 

A simple example will illustrate when linear screening does not apply. Consider a point 
charge Z = —1 in the homogeneous electron gas with high density pọ. Here the screening 
charge is positive, which requires a depletion of the electron density around the impurity. This 
behavior is sketched in Fig. 5.7(a). Assume that the linear screening model works well for 


pir) j---—= 


NS 


LINEAR 


FIGURE 5.7 (Top) Change in electron charge density from linear screening (neglecting Friedel oscillations). 
(Bottom) A nonlinear screening model must be used for a much larger impurity charge. 
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Z = —1, although that may not be a good assumption in metals. Now consider the response to 
a large charge of Z = —10. In the linear response theory, the screening charge is multiplied 
10-fold, so that it predicts the electron density p(r) to have the dashed line in Fig. 5.7(b). Of 
course, this prediction is nonsense, since it has the electron particle density being negative for 
small values of r. The actual electron density has the form shown by the solid line. In this case 
the screening charge is not just 10 times that for Z = —1. 

The macroscopic theory defined the dielectric function ¢(q). But it did not provide a clue 
to the technique of finding it. That is the role of a microscopic theory. The immediate goal is 
to derive a rigorous definition of (q) in terms of microscopic operators. In the following 
section it is shown how this exact equation is solved, approximately, to give model dielectric 
functions which are used in calculations. 

The derivation of the exact equation for e(q) starts by considering the interaction 
between two impurity charges Z,e and Z,e in the homogeneous electron gas. The interaction 
potential between these two charges, in the linear screening model, is proportional to the 
product Z,Z,. The ground state energy of the system is evaluated, and all energy terms are 
extracted which are proportional to Z,Z,. The summation of these terms is defined as the 
interaction potential between the two charges. Of course, there will also be terms proportional 
to Z? or Z (n =2,3,...), which are the energies needed to put each separate charge in by 
itself. The terms Z/Z5' for m,n > 2 are contributions to the nonlinear interaction. All other 
terms are ignored except Z,Z,, since the immediate interest is the derivation of linear 
screening. 

The Hamiltonian of the homogeneous electron gas, with two impurity charges Z,e and 
Ze at R,; and R,, is written as 


Ze 1 2 
H = Hy +" -- © 0, p(q) 0 Ze ® (5.102) 
j=l 


IR; — R| Vazo 
pq) = Chg ope (5.103) 
po 


The first term Hy is the Hamiltonian for the homogeneous electron gas (5.1). It includes the 
electron—electron interactions, as well as the kinetic energy. The term e*Z, Z, /|R, — R,| is the 
direct interaction between the two charges. The last term in (5.102) is the interaction potential 
between each impurity charge Z,e and the electrons of the homogeneous electron gas. They 
are represented by their density operator p(q). 

The ground state energy is calculated from the thermodynamic potential, which is found 
from the linked cluster theorems of (3.264)-(3.266): 


1& 
O=% -g Ui 


D p’ TE 
Ui = ] | dt, use | dt(T, V(t) st V (1) different connected 


where the U, are the different connected diagrams. Only the terms in this series will be 
evaluated which are proportional to Z,Z,. The noninteracting thermodynamic potential Qg 
comes from Hy. At zero temperature, Qg is the ground state energy of the homogeneous 
electron gas, which was evaluated in Secs. 5.1 and 5.2. The last two terms in (5.102) are the 
interaction potential V which enters the perturbation expansion for the thermodynamic 
potential. 


320 Chap.5 e Homogeneous Electron Gas 


The term U, in the expansion has only one power of V. In this order, the only contri- 
bution proportional to Z,Z, is from the direct interaction 


ZZ 
U = -p-m = -p22 | “4 


RR z ved RiR) (5.104) 


(27) 


The other first-order term is zero, since the average of p(q) is zero in the electron gas unless 
q = 0. Of course, these averages are taken with the Hamiltonian Họ, which is without the 
impurities present. 

The term U, has two powers of V. Two powers of the last term in (5.102) give a 
contribution 


l p , 
U, = 2y Ar,2 f dT | dT » Vglq (TPG, T, )p(q ’ T2)) 
qq’ 
x (Z ét R 4 Zet RZ, et Ri + Zet R) (5.105) 


This term is the only one in U,; the other perturbation e*Z,Z,/|R, — R,| does not enter again 
since it has only the one connected diagram. In U, there is a term proportional to Z? and 
another proportional to Z2. They are part of the energy needed to put each charge, separately, 
into the homogeneous electron gas. It is not the total energy, since there are terms in higher 
orders of perturbation theory, proportional to Z’ (n > 3), which also contribute. The term U, 
also has a cross term, which is proportional to Z,Z, 


Z2 iq’ R,+iq'-R P p / 
pew” 2 VgUyi at , TaT. plA 71)P(G T2)) 


U, = V 
qq’ 


There are no other terms proportional to Z,Z,. The higher linked cluster terms U, have only 
higher powers of the charges. The derivation is completed. The net interaction between the 
two charges is the sum of the terms from U, and U,. The U, term is simplified by the fact that 
in a homogeneous system it is nonzero only when q+ q’ = 0, since the density—density 
correlation function of the electron operators is nonzero only in this circumstance: 


dq ia- (R,—R 
AQ = ZZ rae" ( l 2) (5.106) 
T 
B 
a apf dr | dty(T.p(q, t1)p(—4, “| (5.107) 


This formula is compared with the linear screening model (5.99) for the potential from a 
charge distribution. One charge, say Z,, is the impurity (p;(q) = Z,). The other charge Z, is 
the test charge which measures the strength of the screened potential. The net interaction 
between the charges can be written as a screened coulomb interaction of the form 


dq Vg 
V(R, —R,) =AQ=Z,Z e't (Ri —Ry) 5.108 
(R, —R;) a | oa! (5.108) 


which provides a rigorous definition of the dielectric function: 


P 
— =1-4 7 
e(a) l — wf az, | dt {T,p(q, )P(—4, 12)) (5.109) 
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The correlation term can be further simplified. The correlation function depends only on the 
difference of the two arguments tT, — Tt». This fact, along with the periodicity (3.17) of the 
argument, permits one of the t integrals to be eliminated to give 


B B B 
| at, | dt,(T,p(q, T,)P(—q, T2)) = p| dt(T.p(q, t)p(—q, 0)) 


so the inverse dielectric function is 


1l, uf _ 
Zo! a| aeiT.o(a, pl q, 0)) (5.110) 


The exact result (5.110) is very important. It relates the dielectric function to the density— 
density correlation function. The time variation of the operators p(q, t) = e“”p(q)e"“ is 
governed by Ho which is the full Hamiltonian for the homogeneous electron gas but without 
the potential of the impurities. 

The static density—density correlation function (p(q)p(—q)) is related to the static 
structure factor S(q), which was defined in Sec. 1.6. For a system of N, electrons 


= (PDPA) = Nô + S(O) (5.111) 


e 


The pair distribution function g(r) of the electron gas can be obtained from a knowledge of 
S(q). The latter quantity is important in describing correlation in the electron gas. This 
observation raises the question of whether there is any relationship between S(q) and 1/e(q). 
In fact, yes. In order to show this relationship, generalize (5.110) to nonzero values of 
frequency: 


l 
E(q,i®,) 


po 
L=% dre iT pla 2)P(—4. 0) (5.112) 
0 


This result is true, although it has not been proved here. Actually, it could be regarded as the 
definition of the longitudinal dielectric function ¢(q, ia,,). 
Equation (5.112) is used to prove the following important theorem: 


1 (© do 1 I 
N38... +S(q) = -—— aw Im! ———— 5.113 
eðq=0 + S(Q) NoVy im 2n 1 — e-o za z om 


The function e(q, œ) is the retarded function obtained from e(q, iw,,) by io, —> œ + id. The 
subscript “ret” will be omitted, since the retarded function is always intended when writing 
e(q, ©). The factor no = k3/3m? is the particle density, and v, = 4ne*/q’. 

To prove this theorem, again introduce the sets of states |m) and |n), which are exact 
eigenstates of the Hamiltonian. These states were used in Sec. 3.3 for proving several 
theorems. The Hamiltonian of the homogeneous electron gas has Hp|n) = E,|n). These 
complete sets of states are introduced into (5.112): 


1 v, (P 
——__=1--1] day eE 2 io+E, Ep) 
e(q, io) V | T $ et" (alem) e 
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The t-integral can be done and the retarded function is found in a formal manner: 


(5.114) 
1 Vv ep EnEn) — 1 
— 1 — a —BE,, 2 a 
e(q, ©) ve [aletim © + E, — En +16 


The quantity on the right is real except for the factor of id. The imaginary part of this term just 
gives a delta function: 


in| : | = Tay eE al (n|p(q)|mn) Perm) — 1]8(@ + Ey — En) 


€(q, ©) Vim 
(5.115) 
Im] ——] = —(1 = eto) ELS eE (alpa) 28(@ + Ey — En) 
e(q, o) V nm n m 
Divide by (1 — e~®) and integrate over all œ: 
° do l 1 v 
— — J —— | = —4 —BE,, 2 
I nm 1—e-fo m| 5 y 2e |(alpla)lm)| 
v 
= —~ (p(a)p(—9)) (5.116) 


The term on the far right is the static density—density correlation function. It gives the static 
structure factor S(q), as shown in (5.111). This completes the proof of (5.113). At zero 
temperature, the above formula becomes (q Æ 0) 


si =~ [im | (5.117) 


NoVg Jo T e(q, œ) 


which is the way it is often written. The pair distribution function g(r) or the static structure 
factor S(q) is obtained from a knowledge of the frequency-dependent dielectric function 
e(q, œ). The latter formula is not dependent on any assumptions regarding linear screening. It 
is exact. It arises because both S(q) and (q, œ) are related to the density—density correlation 
function. The assumption of linear screening is merely using (5.99) to calculate the screened 
potential from the impurity charge distribution p;(q). 

The density—density correlation function 


| P l 
— | dre" (T.p(q, pq, 0)) (5.118) 


has the appearance of a Green’s function in the Matsubara representation. There is an operator 
o(—q, 0) acting at t = 0 and its inverse at t. Since the operator is the density, the Green’s 
function provides the response of the system to a density fluctuation. To develop the analogy 
further, the function 


1 l 

S(q, ©) = -m| i] (5.119) 
NoVy e(q, @) 

is the spectral function of this operator, since it is proportional to the imaginary part of the 

retarded Green’s functions associated with this correlation function. This observation is 

important, since the spectral functions provide direct physical information. For the electron or 

the phonon, peaks in their spectral functions are interpreted as excitations of these operators. 
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In a similar way, the peaks in S(q, œ) are interpreted as longitudinal excitations of the electron 
gas. These are two-particle excitations, since the density operator itself contains two opera- 
tors—one creation and one destruction. In fact, some of the excitations, such as plasmons, are 
collective excitations of many particles. 

The density operator has boson properties, since it is the product of two fermion 
operators. So S(q, @) is a spectral function for boson operators. Consequently, it has many 
features in common with other spectral functions for bosons; i.e., compare (5.113) with the 
similar phonon result (3.136): 


OQO 


do 
2N +1 = | zy Ma(@) BC, o) 


—oo 


Another feature of S(q, œ) is that it is positive for œ > 0 and negative for œ < 0, with 
S(q, —®) = —S(q, œ). This identity may be shown directly from (5.115). 

The dielectric function has been defined in terms of the interaction between two fixed 
impurity charges. The assumption has been that the impurity charges are classical objects. 
There remains the question of the effective interaction between two electrons, which are 
surely not classical objects. The present theory suggests that there is an additional factor in the 
effective interaction between two electrons, which is a vertex correction 


W(q) = GTO (5.120) 


The main contribution to the vertex correction I (q) arises from the ladder diagrams at the end 
points of the interaction. The method of evaluating this vertex correction is discussed in later 
sections of this chapter. 


5.5. MODEL DIELECTRIC FUNCTIONS 


No one has yet derived the exact dielectric function of the homogeneous electron gas. 
Instead, approximate solutions have been obtained to (5.110). Some of these have been very 
successful, perhaps because they are simple or perhaps because they are accurate. They 
acquire the name of their inventor(s). Four of them are described here, which is only a small 
subset of the vast number which are available. 


5.5.1. Thomas—Fermi 


The Thomas (1927)—Fermi (1928) theory is also called the Fermi-Thomas theory about 
half the time. The derivation begins with the exact equation for the screened potential energy 
from an impurity charge distribution p,(r), 


V*V(r) = 4ne[p,(r) + p,(0)] (5.121) 


where p,(r) is the screening charge. The symbol p is used for charge density, and n(r) for 
particle density (p = —en), and p(q) for the density operator. This equation can be obtained 
from (5.89) by the application of —eV’. The factor —e arises because V(r) is the potential 
energy for electrons. Now start to make approximations. In Thomas—Fermi theory, the 
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electron density n(r) is represented locally as a free-particle system. Write the screening 
charge as the difference between n(r) and the equilibrium charge density no, 


P(r) = —e[n(r) — no] (5.122) 


where the electron charge is —e. For a free-particle system, the local density is n(r) = 
k3(r)/3n*, where the Fermi wave vector is now a local quantity. It, in turn, is determined by 
the condition that the chemical potential ų is independent of position: 

k2 

ke) = E-(r) = u — V(r) (5.123) 

2m 

The physics is illustrated in Fig. 5.8. Assume the potential V(r) is slowly varying in space. 
Draw a little box in each region of space as the shaded region and treat it as a gas of fermions. 
If the absolute Fermi level is at u, then the effective Fermi level is reduced or raised by the 
value of the local potential V. If these approximations are collected, there results the equation 


VV = ielo + eng — en | — d (5.124) 


For atoms, this approximate equation is solved exactly with Ep = 0 and p; = Z 5°(r) to give a 
good description of atomic potentials and charge distributions (Landau and Lifshitz, 1958). 
The assumption that V(r) is slowly varying does not seem unduly restrictive. To get a linear 
screening model, and hence a dielectric function, a further assumption is needed. It is 
assumed that V/E; < 1, so the root is expanded (1 — V /Ery! * ~ | — 3V/2E; to obtain the 
equation 


6 2 
VV = 4nep,(r) + "a V(r) (5.125) 
F 


The term on the right, proportional to V, is now moved to the left. Its coefficient is defined as 
the square of the Thomas—Fermi screening wave vector qrr 


(V? — ip) V(r) = 4nep,(r) (5.126) 
2 
Pe = ren (5.127) 


This equation may be solved in Fourier transform space to give 


3 
d q p;(q) eat 


V(r) = Are | (5.128) 


(2n)° Q? + Ur 


FIGURE 5.8 The Thomas—Fermi model has a Fermi degeneracy energy Er which varies with position according to 
the variation in the potential energy. 
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Compare this equation with (5.98), and conclude that the Thomas—Fermi dielectric function is 
q2 
elq)= 1 + (5.129) 
It has a simple form, which makes it easy to use in a variety of calculations. That is the chief 
explanation for its popularity. 


For example, an analytical result can be obtained when the impurity is a point charge 
0,(q) = Q;. The integrals to evaluate are (v = cos 8) 


[° @dq f! l 
y(n) =-S 1 | dye (5.130) 
T Jo O° + qir J-1 
OQ 
| d 
=% | 11 sin(qr) (5.131) 
nr Jo 9° + UK 
foo d iqr 
=-&] ciclo (5.132) 
INF J_o9 Q* + tr 
V(r) = — E&i emare (5.133) 


The last integral is done by closing the integration contour in the upper half plane and taking 
the residue of the pole at iqrp. The screened interaction has the form of a Yukawa potential. 
The interaction declines rapidly at large distances because of the exponential dependence 
exp(—qz,rr). In metals, the Thomas—Fermi wave vector has a typical value of 1 A~!. The 
screened Coulomb potential declines rapidly on the scale of a unit cell. The screening wave 
vector may be expressed in atomic units as 


1/2 
1.5632 
= | -k = 5.134 
Ao TF ( pao) JF ( ) 
For example, at sodium density r, = 3.96, one finds qrp = 1.48 A7!. 
Thomas—Fermi theory provides only a static model for e(q). It is not usually used to 
describe the dynamic response €(q, œ). 


5.5.2. Lindhard, or RPA 


The Lindhard (1954) dielectric function is more commonly called the RPA, for random 
phase approximation. It is a model for a static e(q) or dynamic ¢€(q, œ) dielectric function. It 
was already introduced in Sec. 5.1 for the discussion of correlation energies. It is rather easy 
to derive and has a simple conceptual basis. It also predicts correctly a number of properties 
of the electron gas such as plasmons. In the early days of electron gas theory, it was the 
dielectric function. Nowadays there is a tendency to use one of the recent models, which are 
better for describing the response of the electron gas. Two derivations of the RPA will be 
presented: one from equations of motion and the second using Green’s functions and 
diagrams. 

The derivation by equations of motion is also called the method of self-consistent field 
(Ehrenreich and Cohen, 1959). One introduces an impurity charge density p,(r, t) or its 
equivalent Fourier transform p,(q,@). The equivalent impurity potential is V,(q,@) = 
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—eQ,(q, œ). There are three potential energies in the problem. The first is from the impurity 
V(q, œ), the second is from the screening charge V,(q, œ), and the third is the total potential, 
which is the sum of these two: 


V(q, ©) = V{(q, œ) + V,(q, ©) (5.135) 
VVE, 1) = 4nep,(t, 1), or Vala, 0) =~ pala 0) (5.136) 
7 4ne 
V°V Ar, t) = 4nep,(r, t), or Vi(q, ©) = — 7 p;(q, ©) (5.137) 


The major assumption in the derivation is that the electrons respond to the total energy V. 
When solving the equations of motion for the electron, use the potential V(r,t) or its 
transform V(q, œ). This choice presents a minor problem, since initially V (q, œ) is not 
known. That is the object of the calculation, and V;(q, œ) is assumed known. Once V (q, œ) is 
known, the dielectric function in the linear screening model is 


_ Vila, ®) 
V(q, ©) 
and the calculation is completed. In the method of self-consistent field, it is assumed that the 


electrons respond to V; then try to determine this function self-consistently. Write the 
effective Hamiltonian for the electrons as 


€(q, ©) (5.138) 


] 
H = X eCpoCpa +7} Va, PC) (5.139) 
po q 


The time dependence V (q, t) is put directly into the Hamiltonian (5.139). The impurity charge 
is regarded as a classical system which is oscillating. The goal is to find the quantum response 
of the electron gas to this classical oscillation. Furthermore, the impurity is assumed to 
oscillate at a single frequency: p,(r, t) = p,(r)e™ and V,(r, t) = V,(r)e""’. The average 
response of the system will depend on œ, so write the average of p(q, +t) as (p(q,7)) = 
o(q, we and the average of V(q, t) as (V(q, t)) = V(q, we. For the homogeneous 
electron gas, the density operator p(q) has an expectation value of zero for q 4 0. When the 
impurity is present, the expectation value is nonzero and is proportional to the average for the 
screening charge: 


(p,(q, 2) = —e(p(q, #)) = —e E (Cira, Coo) 
po 


= —ep(q, we” (5.140) 
Since the averages for p, and p are proportional, it simplifies the discussion to use only one 
symbol, which we choose to be p. For example, in terms of the average (p,) = —e(p), Eq. 
(5.136) is 
4re? 
V(q, ®) = PA p(q, ©) (5.141) 


The dielectric response function is defined as the ratio (5.138). The first term in (5.139) is the 
kinetic energy of the electrons, and the second is their interaction with the self-consistent 
potential V(q, t). Note that there are no explicit electron—electron interactions. They are 
included, indirectly, in the interaction term: the part of V (q, £) from the screening V,(q, t) is 
caused by electron—electron interactions. It is a rather crude way to include these interactions, 
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since it includes screening but neglects all other effects of correlation and exchange. In later 
sections, other dielectric functions are introduced which are more accurate. 

To obtain the screened potential V, in (5.136), an expression is derived for the screening 
particle density p,(q, ¢). It is obtained by writing an equation of motion for this operator and 
then solving it approximately. A perturbation on the system of (q, œ) will cause polarization 
of the electron system, so that the average (p(q, ^) will now have a nonzero value. In the 
linear screening model, this average is assumed to be proportional to the potential causing the 
perturbation, so (p(q, t)) « (V(q, t)). The goal is to determine the constant of proportionality. 

The equation of motion of the density operator comes from 


d 
iela) = [H, pla, D (5.142) 
Actually, it is more convenient to evaluate the equation of motion for the operator: 
d 
i= Co+aoCpo = [Hs Chea oCpol (5.143) 


After it has been evaluated, the result is summed over (po) to obtain p(q, £). The impurity 
potential V(q, t) is assumed to be oscillating at a single frequency exp(—iœt) so that the time 
derivative on the left of (5.102) gives —iaC} +q,0Cpo- The commutators on the right are 
evaluated for the Hamiltonian (5.139), 


3 EklCE, Cis» Cp+q,oCpo] = (Eptq — Ep) Chig.cCpo (5.144) 
l Arct i 
v, VCA, HlCk+g,s Oks Cp+q,o Cpo] 
=- 2 VQ‘ A Cptary,o Ceo — Chao Cp-qo) (5.145) 
which gives the equation 
(Ep — Eq+q + o)Cp p+q.oCps = 2 Va‘, ACÀ p+q+q’.o — Ch+q.0Cp-q.c) 
~ Eva, (CfoCpo — Cp+a,oCp+a,o) (5.146) 


The last term on the right is approximated by taking only the term in the summation which 
has q’ = —q. The terms with other values of q’ are neglected. It is assumed they average out 
to zero. They do not average to zero, but that is what is meant by the random phase 
approximation. The approximate equation can now be solved: 


t t 
Cpo = D ( See’ po = pras prao (5.147) 
i V Ep — Eqig + O 

The above equation is now summed over (po) to give 


_ V(q, t) re Ch oCpo — Chia, o Cp+q,0 


= 
p(q )= XG ptqa’ y co Ep — Eqig +O 


This equation shows that the operator on the left, p(q, £), is proportional to the operator on the 
right, V(q, t). The average is taken of this equation, so that (p) and (V) are replaced by 
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o(q, we and V(q, me. In addition, the number operators ChoCoo and Cr +q,0Cpta.o 
are replaced by their averages n-(&,) and nr(&,+q)- These steps give an equation which 


relates the average of these operators: 


V(q, o) np (Ép) — nr (Sp+q) 


,0) = = V(q, PP (q, © 5.148 
p(q, œ) x 2 E (q, œ)P* (q, @) (5.148) 
which can now be used in (5.141): 
Ane 
V.(q, 0) = 5 PCG, ©) = V(G, oo Pa ©) (5.149) 


The result has the screening particle density p(q, œ) proportional to the self-consistent 

potential V(q, œ). The constant of proportionality is Pq, œ), which is evaluated below. 
The equations may now be solved to obtain the dielectric function. In the equation for 

V(q, ©), substitute the new result for V,(q, œ) and then solve for V(q, œ) in terms of V;(q, œ): 


V(q, ©) = V(q, ©) + V,(q, ©) = Vq, ©) + vP” Y (q, œ) 
Vi(q, ©) 


= — 5.150 
ToP o) (5-150) 
The ratio of these two quantities is just the RPA dielectric function: 
Erra (q, ®) = 1 — v,P(q, @) (5.151) 
n —n 
PY (q, o) = 1 ys ep) — nr Cpr) (5.152) 


The above result completes the derivation using self-consistent fields. 
The second method of deriving Epp, is a diagrammatic analysis using Green’s functions. 
The basic definition of 1/e(q, œ) in (5.112) is rewritten in the interaction representation: 


BO $(B)6 A (_ 
L —]_”% | rein (ES(B)P(G, DP(—4, 0)) (5.153) 
€(q, iœ) v Jo (S(B)) 
l 
V=- E v Chra o Clq oCko Cpo (5.154) 


V pkq oo’ 


where H, is the kinetic energy term in the homogeneous electron gas and V is the electron- 
electron interactions. The operator p has its time dependence determined by Hp in the 
interaction representation. The S matrix will be expanded term by term to see what sort of 
terms develop. One particular subset of these terms will be summed and will yield the RPA. 
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The first term in the expansion for the S matrix is just P“(q, io): 


; l P iO, T ^A A 
P(g, ia) = — at dte'" (T PCa, t)6(—q, 0)) (5.155) 


l p iw, ~ > ~ ~ 
= — V 2 2 | dte’ n (T:Cpo(T)Ck-q,0' (0)Cko (0)Cko (0)) 


Iaf, 
=E | are 9%, 0D +q 0 (5.156) 
p 


_ 1 np (6p) — np (6p + q) 
=o Ste (5.157) 


The Feynman diagram is shown as a single-fermion closed loop in Fig. 5.9. The wiggly lines 
at each end are just added to define the two vertices of the polarization diagram. They could 
indicate that the polarization term is in response to an excitation with wave vector q and 
frequency iœ, The calculation for Epp, is not terminated at this point, since the terms so far 
are 1/e = 1+ v,P) + -.- rather than gp, = 1 — vP. Obviously, more terms are needed 
to get RPA. 

The next term in the S-matrix expansion is 


1 B l B 
-5 z| are | AT Vy 
0 


V pkq! oo’ 0 
x (TOG, Diy g(t Ci yo (TI Cko(TCpo(t)0(—4,0)) (5-158) 


There are four terms which result when Wick’s theorem is applied to this correlation function. 
All contributions have four electron Green’s functions and one Coulomb interaction vy. Their 
diagrams are shown in Fig. 5.10. The first one is a vertex correction to the basic bubble 
diagram. The next two are exchange energy diagrams for the Green’s functions in the bubble; 
they contribute to the self-energy of these Green’s functions. The last diagram contains two 
bubbles which are connected by the Coulomb line v,. 


(G, tw) 


(Góu) 
FIGURE 5.9 


FIGURE 5.10 
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Earlier it was remarked that the density—density correlation function had the appearance 
of a Green’s function. It also has a Dyson equation. The exact evaluation of this correlation 
function may be written as 


1 P(q, i 
=< | dee" T plg Doa 0)) = Ea (5.159) 


— v,P(q, ©) 


Here the density operators p(q, t) have the t dependence governed by H = Ho + V, instead 
of only Hy as in P“). The polarization diagram P(q, iœ) is the summation of all “different” 
polarization terms. Polarization diagrams are not “different” if any of their parts are linked by 
a single Coulomb line v,. For example, the last diagram in Fig. 5.10 is not a different 
polarization diagram. This term arises from the expansion 


PY + Pl) 4... 


= (pu +... (1). yn... 
i o(PO 4p.) +o +0 (Pi +e) +++) 


— pl) 4 v(PYY doo. (5.160) 


where it is the term v,(P“)’. There are terms in P(q, im) which have more than one bubble, 
but they must be connected by more than one Coulomb line. 

The random phase approximation is approximating the exact polarization diagram 
P(q, iœ) by its first term, which is P“)(q, iœ). The RPA is found from the summation of all 
single bubble polarization diagrams 


Igp PE lO (5.161) 
erea 1o, PO 1-0, PO 


which does give Epp, = 1 — v,P™ (q, im). This derivation makes clear the approximate 
nature of the RPA. The total polarization operator P(q, iœ) has an infinite number of terms, 
while the RPA retains one. The exact dielectric function is easily shown to be 


€(q, iœ) = 1 — v,P(q, iœ) (5.162) 


An obvious way to improve the dielectric function is to include more terms in the summation 
of polarization contributions (Geldart and Taylor, 1970). This step is not as simple as it 
sounds. There are an infinite number of possibilities, so some physics must be used to guide 
the choice. Neither does it help that the obvious possibilities, such as the first diagram in Fig. 
5.10, are not simple to evaluate analytically. In fact, most progress has been made by 
nondiagrammatic means, as will be discussed below. 

The RPA dielectric function is evaluated once P“)(q, im) is obtained. The retarded 
function is obtained by taking the analytical continuation iœ —> œ + ið. The retarded 
dielectric function is complex, and its real and imaginary parts are called €, and &,: 


Erpa (q, ©) = £ (q, œ) + ie2(q, œ) (5.163) 
2 2 
JTF m 244, |E + Ur +O 
0) = 14+ 72114" ME, — Info Pe” 
m? E, + qvp —@ 
—_|4E,¢ — (£, — oY] In -4 l 
+ kng? [4E pe, — (€, — ©) ] In e, qir O (5.164) 
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o For q < 2kp 
22 
E5(q, o) =20(S ) qvr —&> 0>0 
em?’ m\* 5.165 
= (Z5) e- (2) (w — £) , Eg qVF > O > Ur — Ez ( ) 
— 0 O > &, + qVF 
e For q > 2kp 


em? ? 
€o(q, ©) = (Z) c — (=) (œ — J Eg t qVF > O > Gur — Eg 


The real part €; (q, œ) may be represented by a single formula. The imaginary part €,(q, œ) 
has a variety of functional relations for different values of (q, œ). 

Figure 5.11 shows graphs for €,,&, and —Im(1/e) for r, =3 and all relevant 
frequencies. For large values of q, say q > 2kp, one finds that €; ~ 1, €, is small, so that 
€, ~ —Im(1/e). There is only a single line drawn for these cases. 


W/E r 


many, 


€ Im(1/€) 
Spare 


m4 2 3 2 4 6 8 
w/Ep w/Ep 


FIGURE 5.11 The RPA predictions regarding €,, £2, Im(1/£) as a function of q and œ. Results shown for r, = 3.0. 
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The real part ¢, always approaches 1 at large œ. At œ — 0 then £, — 0, and the static 
Eppa is just €,(q, 0). Using the notation (x = q/2kp), 


1+x 
1—x 


2 
{TF l 
0=1 +4] +>- 


| (5.166) 


which is always positive. The prefactor to the bracket includes the square of the Thomas- 
Fermi wave vector qrp. For values of q < kp, then £; (q, œ) becomes negative for intermediate 
values of œ. This requires two crossings of the £; = 0 axis. The low-frequency crossing 
always happens when ¢, is large, so that —Im(1/s) = £,/(£? + &) is well behaved when 
€; + 0. However, the high-frequency point where e; =0 has ¢,=0. In that case 
— lim(1/e) = ,/(e7 + £ż) = 76(€,), so that a delta function is obtained. This delta function 
is the plasmon peak which is the sharp singularity on the right of the graph. It is given a 
nonzero width in Fig. 5.11 to aid the eye. Remember that peaks in S(q, œ) are interpreted as 
excitations of the system 


1 l | E, 
S(q, ©) = — — Im = — 5.167 
“a ) Novag En 5 Nog Ef + 85 l ) 
Plasmons are excitations which exist in real metals and in any electron gas. They were 
discovered in ionized gases by Langmuir. They occur at small values of q, as is evident in Fig. 
5.11. 
Examine the limit of (5.164) when q — 0 while œ remains nonzero. The first step is to 
expand P®(q, œ) while assuming £, & @, qur X œ. Let A = £p+q — Ep and assume that 
A < o so that 


2 1 1 
P(g, 0) == o 5.168 
(4, 0) 25 alo ae =| (5.168) 
2 1 l 4 A A 
= — } nr(&)| — yj -n ~ Ermey eae 
vO S ô 142 vWF © O 
O O 
2naE l 


This result is inserted into € = 1 — v,P™ which derives the high frequency limit of the 
dielectric function 


o2 1 1 
. — P 3 2 2 
lim €,(q,@) = 1 — = lı +z [2 (qvr) + s4] + (5) (5.170) 


» 4neno 
2 _ 0 


wo (5.171) 


m 


For £, = 0 the condition that €, = 0 predicts that the plasmon peak occurs at the frequency 


3G’ Up 
o=o,{1+——-+.--- 5.172 

p ( + 10 o2 + l ) 
At q=0 the prediction is quite simple, œ = @,. The quantity œ, is called the plasma 
frequency of the electron gas. It depends only on the electron density ng and mass m, which is 
actually the effective band mass, although these are close to the free-electron value for many 
metals. Table 5.1 shows some actual plasma frequencies measured in metals, which are 
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TABLE 5.1 Plasma frequencies in eV. The values for hw, are 
corrected for the dielectric screening of the core electrons 


Metal Experimental ho, ho, Reference 

Li 7.12 8.03 7.95 a 

Na 5.71 5.90 5.77 a 
5.85 b 

K 3.72 4.36 4.02 a 
3.87 c 

Mg 10.6 10.88 10.70 d 

Al 15.3 15.77 15.55 e 


aC, Kunz, Phys. Lett. 15, 312 (1965). 

bJ, B. Swan, Phys. Rev. 135, A1467 (1964). 

“J. L. Robins and F. E. Best, Proc. Phys. Soc. London 79, 110 (1962). 
dC, J. Powell and J. B. Swan, Phys. Rev. 116, 81 (1959). 

€C. J. Powell and J. B. Swan, Phys. Rev. 115, 869 (1959). 


compared with this simple formula. The predictions for œ, are found to be amazingly 
accurate. One great reason for the popularity of the RPA dielectric function is that it obviously 
describes plasmons very well. Later it is shown that it is exact in the limit where q —> 0, so 
that this success is understandable. 

The formulas for €,(q,@) in (5.165) are complicated. They may be derived in the 


following fashion. First, begin with the definition of the imaginary part of the retarded 
function: 


e(q, ©) = —v, Im[P2)(q, o) 
dp 
= 2nv, es S(Ep — Eptq + OMe lp) — nr (Eprg) (5.173) 


A variable change is made in the term with n F(Sp+q) by replacing p > —p — q, so the above 
equation becomes 


U 
e2(q, ©) = On | Pane OC, — Epig + ©) — ÒlEp — Ep+q — ©)] 
k 1 
a| Pa| l qpVv ( qpV | 
— 4 d ` ol- w 
On P Ip Eka (e, + „TO Ôl E + a +0) (5.174) 
This formula shows that €,(q, œ) is antisymmetric in frequency €,(q, œ) = —,(q, —œ). It 


will be evaluated first for œ > 0. The angular integrations (v = cos 9) are done first, and they 
eliminate the delta function: 


l 
qpvV m 
dvd — +0} =—Op — 
E V (e + m o) pq (p P12) (5.175) 


m 
Pi? = 7 lO = eal (5.176) 


2me { [** kr 
e&:(q, ©) = 7 (| pap — | pap) (5.177) 


Pi P2 


2 
£(q, ©) = or lk: — py (ke — p?) — (kp — pakk — p3) (5.178) 
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The complicated aspect of the integral comes from the lower limits of integration p, and p>. 
These limits are imposed by the angular integral over the delta function. For example, in the 
first integral the delta function forces v = cos @ to equal 


—] <v=(o-£,)— < l (5.179) 
Pq 
and v must have values between +1. The latter conditions restrict the value of p to 
m 
-p < — (0 — £&) <p (5.180) 
q 


m 
p> |@— el = Pi (5.181) 


The integral contributes only when k? > p° > p%. In the second integral, the result is similar 
and only the sign of œ is changed. The result in (5.178) is identical with (5.165), although 
there the result is simplified. For example, kp > p, may be satisfied only when q < 2k;, so 
the second term may be eliminated when q > 2kp. 

At small values of q, €5(q, œ) is proportional to œ at small values of œ. This dependence 
is shown in (5.165) and is evident from the graphs in Fig. 5.11. The proportionality to œ is an 
important feature of €,. It arises in (5.178) whenever both inequalities kp > pı, p are 
satisfied, since p? — pł = 2mo. The linear dependence of e, on œ must also occur for the 
exact dielectric function, which may be shown by a simple argument. The physical process 
under consideration is the rate at which electron—hole pairs are made in the electron gas. A 
hole is a state which has an electron removed from the filled Fermi sea. An initial electron of 
momentum p and energy §, is excited by a perturbation with (q, œ). The electron is excited to 
a new state with momentum p + q and energy 14, = 6q +. The electron can only be 
scattered into states which are previously unoccupied, so §,,, must be above the occupied 
Fermi sea. The basic process takes an electron from below to above the Fermi level. It leaves a 
vacancy in the Fermi sea, which is the hole. The excitation process makes electron-hole pairs. 
The net rate of making such pairs is 


3 
Roairs = 4n ies O(€, 7 Ep+q + o){nr (6p) [1 — Nr(Sp+q)! _ Nr(Spiq Ll! — np (6p)l} (5.182) 


where np(Ep)[1 — nF (Čp+q)] is the rate of making pairs, while nr(Sp+q)[1 — nr(S))] is the 
return rate. A formula similar to (5.173) is derived, except that neither &, nor np(6,) need to 
be their free-particle values. They could be interpreted as the energy and occupation number 
for the fully interacting system. Now change integration variables to pdp = md&, = mdé: and 
pdv = (m/q)dbp+q = (m /q)d&', the above integral may be written as 


2 
Rais © | dé |as +o —€)ing(®) — nE) (5.183) 


2 
~ T | deine) ~ npl& +0) (5.184) 
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where the limits of integration are ignored in this simple argument. The remaining integral is 
over a distribution np(&) minus the same integral over the displaced distribution n-(€ + ©). 
The difference between these two must be proportional to œ, at small œ, since 


nr +0) ~ n) + o TE (5.185) 
7 om? dny(S)| _ om? 
Rpairs ~ om fae - ae) = nq (5.186) 


The rate of making electron-hole pairs is proportional to œ. The derivation applies only to 
small values of w, since at large values the limits of integration change the result as they did 
in the case of the RPA. The area in (q, œ) space over which €,(g, œ) is nonzero is shown in 
Fig. 5.12 by the shaded region. These are bounded by the two lines œ = £, + qvr. The 
excitation spectrum of the electron gas is given by S(q, œ) in (5.167). It is nonzero when €, is 
nonzero, so that it also exists in the shaded area of Fig. 5.12. In addition, S(g, œ) has the 
plasmon peak which exists where £, = 0. It is also shown in Fig. 5.12. This figure actually 
describes the excitation spectrum of the electron gas to density fluctuations. At large values of 
q/kr, S(q, ©) is given accurately by the approximation S(q, ©) ~ &)/Ngv,, So that it describes 
the process whereby single electron-hole pairs are made. At intermediate values of q/kp, the 
excitation spectrum has the complex shape shown in Fig. 5.12. These curves can be measured 
in metals by the inelastic scattering of energetic electrons through large momentum transfers. 
However,the comparison with experiments is deferred until later sections, after several other 
dielectric models have been introduced. 
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FIGURE 5.12 The excitation region of the electron gas in (q, œ) space. The plasmon line œ, becomes highly 
damped in the region of electron-hole pairs, which is shown hatched. 
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5.5.3. Hubbard 
Hubbard (1957) introduced a correction factor to the RPA of the form 


v,P(q, ©) 


.@) = 1 —-—_* 2 | 
€y (q, ©) 1+ 0, GPO, O) (5.187) 
l q? 


The factor Gy(q) is a local field correction. There is an analogy with the Lorenz—Lorentz 
dielectric function, which has the form in cubic insulators 


(5.189) 


The two formulas have the same structure if we associate —v,P() = 4na and Gy = 3. The 
factor of 4a/3 comes from summing the local dipoles in the lattice. It is called a “local field 
correction”. Similarly, Gy(q) in the Hubbard dielectric function comes from summing over 
the surrounding structure of the dielectric, which in this case is the exchange-correlation hole. 

This curious formula was regarded as an improvement to the RPA in many properties. 
Yet its true worth was unappreciated, because it could not be compared with the better 
dielectric functions which only became available later. The best dielectric functions today are 
written in precisely the form (5.187), with a G,(q) which is only slightly different from the 
simple form proposed by Hubbard. His result, which was somewhat of a stab in the dark, is 
now regarded as being well ahead of its time. 

The factor Gy(q) is introduced to account for the existence of the exchange and 
correlation hole around the electron. The dielectric function describes how the conduction 
electrons of the metal rearrange their positions to screen the Coulomb potential. Because of 
the exchange and correlation hole around each electron, when one electron is participating in 
the dielectric screening, others are less likely to be found nearby. The exchange-correlation 
hole should have some affect upon the nature of the dielectric screening. 

The Hubbard local field comes from the vertex corrections in the polarization function 
P(q, œ). The most important vertex corrections are the ladder diagrams in Fig 5.13. They can 
be evaluated by introducing a vertex function [(k,k+q) in a four vector notation 
k = (k, ik,). This approximation to the polarization diagrams is denoted P; where “L’ 
denotes ladder 


P,(q, ©) = SE GO (KYGO(k +lk, k +9) 
1 
By 


(5.190) 


Tk, k+q)=1-—Y Wk -p) 9 WG P+ OTR. p +q) 
P 


The above expression is rather difficult to evaluate exactly. The degree of complexity depends 
upon the choice for the effective interaction W (k — p) which are the ladders. 

In the Hartree-Fock approximation, to the ladder diagrams, the function W (k — p) is the 
unscreened Coulomb interaction W (q) = v4. Since the interaction has no dependence upon 
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FIGURE 5.13 Ladder diagrams of the polarization bubble. 


frequency, neither does the vertex function I (k, k + q) depend upon ik,,. In this special case 
the frequency summation ip, is simple and the above expression is 
— "p+ 


I(k,k + gm) =1-| © v B + q: igp 
( q: igm) a TP. p Tind E 


This expression has been evaluated by Sham (1973) and by Brosens et al. (1977). They also 
added the exchange self-energies X,(p) — Ł,(p + q) to the energy denominator. In this case 
the theory is called “conserving” (Baym and Kadanoff, 1961) in that the vertex corrections 
and self-energies are both evaluated to the same level of approximation. This dielectric 
function has some interesting structure. 

A better approximation is to have W(qg) be a screened interaction such as 
W(q) = v,/e(q). In this case the dielectric function e(q) depends upon frequency (iq) which 
complicates the evaluation of the integral equation for the ladder diagrams. An approximate 
result is obtained by making some drastic approximations. The first one is to assume that 
I (k, k+ q) © I(q) and the k dependence is ignored. This approximation signficantly reduces 
the complexity of the analysis, since the equations now have the form 


P(g) = T(P) (5.191) 


rO =1-T@A@ = KG (5.192) 
AQ) == EWE -Ipp a) (5.193) 


By ‘> 


There is a problem with the last expression. If '(qg) does not depend upon (q), then neither 
can A(q). Therefore, the factor of W(k — p) must have its argument replaced by another one. 
Here is where Hubbard made his important approximation. He replaced W(k — p) by a 
statically screened interaction 


Ane? 
W(k —p) > ———— 5.194 
(k —p) P+ dr ( ) 
2ne ai) (1) 
A(q) = z P (4) = 0, Gu (QP (q) (5.195) 
q? + dor 
q? 
Ga (q) = -———— 5.196 


Once the factor of W (q) is removed from the summation over p, then that summation just 
gives 1 P‘(q). The factor of 4 comes because P) contains a summation over spins which 
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multiplies the result by two. The ladder terms do not have a summation over spin states. Each 
electron line in the polarization bubble diagram has the same spin configuration. So the result 
is multiplied by 4 to account for the factor of 2 in P). Note that the derivation gave a 
Hubbard factor with a denominator of (q? + q7,-) rather than the original Hubbard expression 
of (q? + k2). Actually, Hubbard waffled, and published both results in separate papers. 
Another time he suggested 


l q? 
This latter result is remarkably close to the best theories today. 

Why do the ladder diagrams come from the exchange-correlation hole? The polarization 
diagram is a bubble with two electron lines. One is going forward in time (“electron”) and 
one is going backward in time (“hole”). The hole is an excitation from below the chemical 
potential. The ladder diagrams introduce scattering, and therefore correlations, between the 
electrons and holes. The correlation hole around the electron must be created by the low 
energy excitations of the electron gas, which are just the electron-hole pairs. The correlation 


hole is due to correlations between electrons and holes. 


5.5.4. Singwi-Sjolander 


The 1967 Singwi—Sjélander dielectric function has the same form (5.187) as Hubbard’s 
but with a different choice of G,(q). Singwi has collaborated with a variety of authors to 
develop improvements in the method of choosing Gy(q). Once could remark, in a humorous 
way, that G;,(q) is time dependent because of its improvements over the years. This variety 
makes the discussion difficult, since there are several possible choices to describe. The 
original version of Singwi-Sjélander is derived, and the others are cited in the references (see 
Vashishta and Singwi, 1972; Singwi and Tosi, 1981). 

Their original derivation is very attractive, because it explicitly includes the exchange- 
correlation hole. They derive an equation of motion for the screening charge and then insert 
the pair distribution function g(r) into the Coulomb integral between particles. They solved 
these equations and obtained 


v,P(q, @) 


e(q, œ) = 1 — ———___.—~ 
1 G PO(q, 
me H(g)P\(q, ©) (5.198) 
o lf kako yy 
Gua) = -y E p Sa-w- 


where S(q) is the static structure factor associated with g(r). Their original derivation employs 
Wigner distribution functions. An alternate derivation is presented here which was given by 
Singwi et al. (1968). It uses equations of motion. They find the second time derivative of the 
particle density operator p(q, 7) and derive an expression for the plasma frequency. The 
plasma frequency is given by e(q, œ) = 0. Remember that at high frequency v,P ~ w/o’, 
then using the equation for the dielectric function gives 


2 


qv 
—~0=1-—l1- 4 F , 
€ ral G,(q) + o( =i )| (5.199) 


Corrections to the plasma frequency can be associated with the Hubbard local field Gy. 
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The first time derivative is a single commutator with H, so the second time derivative is a 
double commutator, 


p(q, t) = —[H, [H, plq, Dl (5.200) 


p(q, t) = ro ptq.oC] (5.201) 


where H is the Hamiltonian for the homogeneous electron gas. Consider the first commutator, 
[H, p(4, D] = DEpig — &p) Ch +q,0Cpo (5.202) 
po 


where p(q, t) commutes with the electron—electron term in the Hamiltonian. That is not 
surprising, since electron—electron interactions can be written as dq vgP(q)p(—q) except for 
self-interaction effects. An operator commutes with itself. The electron—electron term 
contains just density operators, which is why it commutes with p(q, t). The second 
commutator provides the interesting result. Another commutator with the kinetic term in H 
just gives a repetition of the energy difference factor: 


y 
È Ek sCh Cis: 3 (Ep+q ~ EC aC fo} „|= 2(Eptq — p) Cp+q,o Cpo 


The commutator with the interaction gives four terms, which can be combined to give two 
different ones by rearranging the order of operators. They are 


t 
E D (pta — Sp¥g lpg, Cigs Crs Cp Cita oCpo] 
pp 'q' 5,5',0 


-= X (Epa — Sp Pg lCprqtq'oCk—q.sCksCpo — Chrao Ch-g,sCksCp-g,ol (5-203) 
pq'so 


K. 


The change in dummy variables p > p + q’ in the last term makes the operator sequence the 
same as the first term: 


=>, Vgc "gta oCk-qi,sCksCpol(Ep+q — Ep) — (Epsqta’ — Eptq’)] 


The kinetic energies largely cancel in this expression, The bracket is simply —q-q’/m. In 


writing this term, the operator sequence c! tata’ Ciy, sCksCpo Will be rearranged into 
ch tatg 'oCpoCk _q,sCks: After summing over internal variables, this expression becomes 


plq + q’, p(—q’, t). The rearrangement is permissible as long as the term is omitted which 
has a particle acting on itself; a self-interaction is included in pp unless care is taken to leave 
it out. The various terms are collected into the equation of motion for the density operator: 


—p(q, t) = 2 (Ep+a 7 Ep) ChigcCpo (5.204) 
q’ 
+- > Vg (= LL) oq + q’, t)p(—q’, t) (5.205) 
An obvious way to evaluate the last term is to take q’ = —q in the summation. This term 


alone gives the equation [p(0) = N,] 


2 
, vq N, 
Ba, ) = 2 -—* plq, 1) = PCa, 1) (5.206) 
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which describes the plasma oscillations in the electron gas. Since the other terms all vanish 
when q — 0, it has been shown that the long-wavelength density oscillations are the same 
plasma oscillations as predicted in the RPA. The RPA is exact in this limit. 

The Singwi-Sjélander result is obtained from treating (p(q + q’, t)p(—q’, t)) more 
carefully. They wrote it in terms of a direct summation over particle positions, as was done in 
Sec. 1.6: 


pat DPE = Leen Yew (5.207) 

JAi 
The term i = j is excluded, since it is a particle interacting with itself. The factors in the 
exponent are rearranged to have one summation over a particle r, and another summation over 


the particle separations r; — r: 


o(q+q \p(—q/ )= > eit > elit) (q+q’) (5.208) 
j i$j 


By following the definitions in Sec. 1.6, the summation over r; — r; can be replaced by its 
average in the electron gas: 


> XD => mo Ja rge D = N bgg HSA Hq')- 1 
iŻj 
The other summation Y` e’4°" is just p(q, £), so that 

(Pla + q')p(—q’))’ = (Pla, DNS + S(Q+q/) 1] (5.209) 


The prime on the bracket means, in doing the averages, to omit the interaction of a particle 
with itself. This omission produces the —1 term on the right. The term N,0,;q is the same 
one included in the above derivation, which gave plasmons. The other term gives G;,(q). It is 
already proportional to p(q, £), so that Gy(q) is obtained without difficulty. The electron- 
electron interactions give a term 


1 —q-q’ 
(p(q, t)) > Uy (= q \INBas4 + S(q +q) — 1] 
q’ 


= w(p(q. DU — Gula) (5.210) 
d? 
Gy(q) = males 4 isa+q)- 1 


œ p(q, t) = > (Ep+4 — €p) (C Bta, o Cpo? + o7 (p(q, [L — Cula) (5.211) 
po 


The factor of Gy(q) provides a modification of the plasmon dispersion at nonzero q due to 
electron correlations. The formula for Gy (q) is the same one which was announced in (5.198) 
after changing dummy variables k = —q’. 

The Singwi-Sjélander formula for Gy(q) can be evaluated with only a knowledge of 
S(q). It, in turn, is obtained from a knowledge of e(q, œ) through the relationship (5.117) 


derived earlier: 
1 (“do 1 
S(q) = — — l — im| | 
Nog Jo T €(q, œ) 


But e(q, œ) depends on Gy(q). The three equations (5.198), (5.187), and (5.117) form a triad 
which link the three functions G;,(q), S(q), and €(q, œ). They are solved self-consistently on 
the computer, which must be done for each value of r,, since the results depend on density. In 
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TABLE 5.2 Constants A and B in the Hubbard local field 


Vashista and Singwi (1972) Singwi et al. (1970) 


A B A B 
1 0.70853 0.36940 0.7756 0.4307 
2 0.85509 0.33117 0.8994 0.3401 
3 0.97805 0.30440 0.9629 0.2924 
4 1.08482 0.28430 0.9959 0.2612 
5 1.17987 0.26850 1.0138 0.2377 
6 1.26569 0.25561 1.0216 0.2189 


discussing some later versions of G,,(q), they (Singwi et al., 1970; Vashishta and Singwi, 
1972) remarked that their results could be adequately fitted by the simple expression 


Gy(q) = A{1 — exp[-B(q4/4rr)]} (5.212) 


The constants A and B, both dimensionless, are given in Table 5.2 for different values of r,. 
Since they depend smoothly on density, the parameters A and B may be obtained for other 
values of r, by interpolation. This form for G;,(q) fits their computed one well at small and 
intermediate values of g/k, but not at larger values. However, Gy is relatively uninportant at 
large g, so this drawback is not serious. In the next section a number of properties of the 
electron gas are calculated using all four dielectric functions. In general the accuracy 
increases in the order they were present: Thomas—Fermi is least accurate, while Singwi— 
Sjölander is the best. 


5.5.5. Local Field Corrections 


The Hubbard local field correction is actually a function of frequency G;,(q, œ). The 
frequency dependence is much less well known than the dependence upon wave vector. 
Numerous theorists (Niklasson, 1974; Santoro and Giuliani 1988; Vignale, 1988) have 
derived the exact limit of G,(q, œ) and G,(q, œ) in the limit of large q. They are shown in 
Table 5.3. The local field for spin correlations G,(q, œ) is introduced later in the chapter. In 
taking these limits, the result depends upon the behavior of œ while q — oo. The case for 
œ = 0 and hw = g, give the same limit in two (2D) and three (3D) dimensions. The case that 
œ Æ 0, but not diverging, give different results in 2D and 3D. Here g(0) is the pair distribution 
function at r = 0. The usual method of deriving these results is to derive equations for the 


TABLE 5.3 Exact asymptotic limits (q — oo) of the local fields 

for charge G,(g, œ) and spin G,(q, œ). The first row is for œ = 0 

the second row is for nonzero œ. The third row scales hœ with £,. 
From Santoro and Giuliani (1988, used with permission) 


3D 2D 
Gy Gs Gy Gs 
o = 1 — g(0) g(0) 1 — g(0) g(0) 
a #0 Z0- e0] 3[4e0)-1] 1-g0)  g(0) 


hw = &, 1 — g(0) g(0) 1—g(0) g0) 
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response functions using the equations of motion. The correction terms vanish in the limit of 
large wave vector and frequency, so that it is possible to obtain exact limits. They find at large 
q, and for œ Æ 0, the asymptotic limits 


.a’\2 / 
Gy (q, œ) = >E 2 ogee) ~ ilis) — | 


e q oo’ q v(q) (5.213) 
G (q-q’) vq’) | n 
s\q, o) = _ IN, > 2 ro sgn(oo ) [Soo lq) Ôso'] 
Soo (0) = = (Pa(—WDPa(A)) -Eqo (5.214) 
ae p+a.o€ (5.215) 


These formulas have the flavor of the Singwi—Sjélander theory, which is not surprising since 
they were derived in a similar fashion. In practice, the local field corrections are rarely needed 
in these limits of large variables. However, exact results are always useful as benchmarks for 
approximate theories. 

Another method to obtain information regarding the local field corrections is from 
experiments. Larson et al. (1996) have determined the local field correction Gy(q) in metallic 
aluminum from an analysis of x-ray scattering. The scattering measures S(q, œ) for a fixed 
value of q as a function of œ. This data is compared to the theoretical result which is written 
in the form 


1 
S(q, @) x —Im aaa (5.216) 


The RPA polarization P‘)(q, œ) was calculated by Fleszar et al. (1995) using the accurate 
numerical wave functions for the energy bands of aluminum. For each value of q, the quantity 
G;;(q) is treated as an adjustable constant to improve the agreement between the computed 
and experimental values. Repeating the experiment for many different points q gives a plot of 
G(q) vs. q, which is shown in Fig. 5.14. There is reasonable agreement between the 
measured values, and the predictions for the jellium model. 
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FIGURE 5.14 Measured local field function Gy(q) in metallic aluminum by Larson et al. (1996). Also shown are 
several theoretical curves based upon the homogeneous electron gas (used with permission). 
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5.5.6. Vertex Corrections 


The Hubbard local field factor Gy(q) was derived as a vertex correction to the polar- 
ization diagram of the electron gas. This vertex correction is denoted as I’;,(q), and the 
Hubbard form of the dielectric function is written as 


éy(q) = 1 —v,P(@l x(q) (5.217) 


r,(q) = 


1+ 0,G,(@PO@) (5:218) 


Where else might vertex corrections enter into the calculations of the electron gas? 

The first point is to distinguish between the interaction between two impurity charges 
Z2 in the electron gas, compared to the interaction between two electrons. The impurities 
were assumed not to be electrons. The above screening function is appropriate only for the 
interaction between two protons in the electron gas, or between other charged ions. 

Additional vertex corrections are required when electrons interact among themselves. In 
some cases one factor is required, while at other times two are required. This confusing 
situation is now discussed. The first case is the calculation of the screened exchange energy 
2} of an electron. The self-energy is 


Aq Tuld idm) 

£. (k, ik,) = —— = v, EF GOK + q, ik, + idm) (5.219) 

l D ony" elig)  ““*4 

€(q, idm) = 1 — vaP” (q, idm) H(A, idm) (5.220) 

l 

Daulq, idm) = — ooo 7 5.221 
nA im) =T y GOPA tan) (5:220 
raj 

H(A igm) _ l (5.222) 


The integrand contains a single vertex function I y(q, ig,,). The vertex function should 
depend upon the k variable. In four-vector notation, it is a function of I (k, k + q). Neglecting 
the k dependence is an approximation. All dielectric functions of the Hubbard type, with a 
G,,(q) factor, make this approximation of neglecting the k dependence of the vertex function. 

Figure 5.15 shows the vertex corrections of the screened exchange energy. The wiggly 
line denotes the interaction W(q) = v,/€(q), where e(q) is the Hubbard dielectric function. 
The first diagram has a single wiggly line and corresponds to the interaction v,/e(q). The 
second diagram has two wiggly lines. Because of the symmetry, the diagram could be a vertex 
correction to either end point. Since the diagram occurs only once, it cannot be a vertex 
correction to both end points. Keeping only the ladder diagrams, then the vertex corrections to 
the first end point are shown by several more terms in the series. The summation of these 
terms, the vertex correction, is shown in Feynman diagrams as a shaded triangle. 

The key point is that only one end of the large wiggly interaction has a vertex correction. 
Do not put a vertex correction at each end, since that is overcounting. In Fig. 5.15(b) is shown 
two Feynman diagrams. The first has a vertex correction at each end of the large wiggly line. 
This term seems to suggest a correction for each end. However , the second Feynman diagram 
in Fig. 5.15(b) is topologically identical to the first. The only difference is that a different 
wiggly line has been drawn “large”. That diagram can be considered to be a vertex correction 
to the first end point, where one of the ladder diagrams itself has a vertex correction. In fact, 
the correct way to include all ladder diagrams is to have each ladder also have a vertex 
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(a) 


(b) 


FIGURE 5.15 Vertex corrections to the screened exchange energy. 


correction at each end. This set of Feynman diagrams has fractal character (Mahan, 1993). 
Writing the equations for all ladder corrections makes it obvious that the vertex correction 
only goes on one end in the screened exchange energy. 

There is another way to show the correctness of Eq. (5.219). Ting et al. (1975) showed 
the vertex function I y enters only once in effective interaction. Furthermore, they also 
showed the same vertex function enters the numerator as enters the dielectric function. 

Ting et al. (1975) derived (5.219) by starting from the definition of the ground state 
energy of the interacting electron system 


1 3 
V | d’q | 
E, =E -5l — —— 1 5.223 
6 20 2B 07 (2n)° | iq) | l ) 
nv,P®(q, iq) 
1 + nv, Gy(q)P(q, iq) 


The derivation of this formula starts by defining the ground state energy as the expectation 
value of the Hamiltonian (H). The term E,o is from the kinetic energy. The second term is 
from the expectation value of the electron—electron interaction (p(q)p(—q)). The latter is 
related to the inverse dielectric function using (5.112). The coupling constant n has been 
made dimensionless by having n = 0 at e = 0 and n = 1 at e. Gy(q) may also be a 
functional of the product ny, where % = v,P®. 

The self-energy can be obtained as the functional derivative of the ground state energy 
with respect to the occupation number n,. For example, the kinetic energy term can be written 
as 


€,(q, ig) = 1 (5.224) 


Eo = > Enno (5.225) 
po 
OE 
g0 
n TE (5.226) 
_. 8 
(Ps Py) = <= — by (5.227) 


po 
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The screened exchange energy is the derivative of the energy in (5.223). Assume the only 
dependence upon npo in the integrand is in the susceptibility x. The functional derivative has 
the form for f = 1/e(q) is 


5 Sf 6PY 
f v, L (5.228) 
Ngo OX Ny, 
] l | 
V 3 P (z _ Cp+q + lan Ép — Čp+a E z) 
òP) 1l 
s = IG +d, €p + ign) + IMD + q, &p — ign) (5.230) 
po 


The two Green’s function terms in 5P()/ Oy, contribute equally, since e(q,ig,) is a 
symmetric function of complex frequency ig,. This identity removes the factor of 2 in front of 
the interaction term in (5.223). The two steps of coupling constant integral and functional 
derivative can be arranged to cancel. Introduce the variable y = ny, which becomes the 
variable of both integration and differentiation: 


[arw if gyro] ] 
on ay XJo ` dy 


When f(x) = 1/e = 1/(1 —Tx) then f(x) —f(0O) = xI/s. Collecting all of these results 
produces (5.219). This derivation is independent of the particular form for the vertex function 
I’. It is also valid for any form for Gy(q) as long as one assumes that G;,(q) is a functional of 
the susceptibility y. This derivation makes it clear that the screened exchange energy has only 
a single factor of the vertex function in its effective interaction. 

There are occasions where an effective Coulomb interaction between two electrons has a 
double power (T$) of the vertex correction. The ladder diagrams of the polarization diagrams 
could themselves have vertex corrections at each end of the wiggly line. The wiggly line is 
still the interaction v,/e(q). An equation for the vertex correction in this case is, using a four 
vector notation 


T(p,p+q) =1—- Da G 7 


x rep tgptgtQ9Optg9y@rqtd) — (5.232) 
In the approximation of taking I (p, k) + I'(p — k) then this equation simplifies to 


ra 


~ f(x) -f0] (5.231) 


Tq) =1-T@ a) w Il gp + )\G(p ++) (5.233) 


The effective interaction for the ladder diagram has a double vertex correction. A double term 
always happens when two different electrons are interacting. A single vertex occurs when one 
electron interacts with itself through the average polarization of the media. 

The effective interaction in the screened exchange energy is given in Eq. (5.222). The 
Coulomb interaction v, is multiplied by the factor of [1 — G,(q)|. This expression is often 
written down as the effective dielectric function. In writing it this way, a vertex correction has 
been included. If G;,(qg) = 1 for some value of q then the Coulomb interaction is effectively 


unscreened. In the x-ray scattering from aluminum, that occurs at q = 1.65k, (Larson et al., 
1996). 
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5.6. PROPERTIES OF THE ELECTRON GAS 


There are a number of properties of the electron gas which may be predicted with a 
knowledge of the dielectric function (q, œ). These properties may also be measured for 
metals. However, one should be careful in applying predictions of the homogeneous electron 
gas to real metals, because the ion cores often have a significant influence. A favorable 
comparison to experiment may not, in his case, be the signature of the best theory. Four 
properties are discussed: the pair distribution function, the screening charge, the compres- 
sibility, and the correlation energy. 


5.6.1. Pair Distribution Function 


The pair distribution function g(r) may be obtained from a knowledge of the static 
structure factor S(q) through the transform relation (1.445): 


CO 

g(r) =1+ =| kdk sin(kr)[S(k) — 1] (5.234) 
2rkè Jo 7 

The static structure factor S(k) is obtained from the dielectric function through the relation 

(5.117). The Thomas—Fermi theory is a static model of dielectric screening, so that it has no 

prediction regarding g(r). So g(r) is calculated using the theories of RPA, Hubbard, and 

Singwi—Sjolander. All results are obtained only after a numerical computation. 

The three results for g(r) are shown in Fig. 5.16 for six values of r,. The RPA and 
Hubbard theories are negative at small values of r. This behavior is viewed as a great defi- 
ciency, since g(r) is, by definition, strictly a positive function. The pair distribution function is 
defined as the probability that another electron is a distance r away from the first; as a 
probability, it must be positive. The negative values predicted by the RPA and Hubbard 
theories are regarded as significant failures of these theories. The Singwi—Sj6lander theory is 
slightly negative at large r, but otherwise positive, so it is much better in this respect. For most 
metallic densities, it predicts g(r) to be positive for all r. The Singwi—Sjélander theory is a 
significant improvement over the earlier theories. 

Kimball (1973) proved that 2’(0) = g(0)/ay where ap is the Bohr radius and g’ = dg/dr. 
His proof is based upon the assumption that when two electrons become very close together, 
the other electrons can be ignored and it becomes a two-body problem. The relative wave 
function is similar to the hydrogen atom, but with a repulsive interaction. The same result was 
obtained more rigorously (i.e., including the other electrons) by Vignale (1988). 


5.6.2. Screening Charge 


The screened potential V(r) about a point charge Zed(r) may be calculated and plotted. It 
is slightly inconvenient, since V(r) diverges as r~! at small r, as do all Coulomb potentials. 
Instead, it is customary to plot the density of screening charge n,(r) about the point impurity. 
This quantity is nonzero at the origin r = 0 and also has the Friedel oscillations at large 
distance. In the linear screening model, derived in Sec. 5.4, the potential energy from an 
impurity charge of Ze is | 


dq Vg 


ny aq) a'r (5.235) 


V(r) = -z| 
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Operate on both sides of this equation with V?. From Poisson’s equation, V7 V is proportional 
to the excess charge density in the system. There is charge from the central impurity Zed(r) 
and the screening particle density n,(r). The equation for this screening charge is: 


dP iq'r 
VV = 4ne[Z8(r) — n.(r)] = 4ne2Z | om 5 (5.236) 
oz) ËI irf 
n(r)=Z l np e'i Í 5 (5.237) 


The distribution is spherically symmetric in the homogeneous electron gas, so the angular 
integrals may be performed at once: 


OO 


Z ] 
n(r) = 2; | gdp snar) 1 — a (5.238) 


In this case a knowledge of the static dielectric function &(q) is sufficient to determine the 
density of screening charge. The Thomas—Fermi model may also be used to make predictions. 
It was shown in Sec. 5.5.1 that it predicted the screened potential energy to be 
Ze? exp(—qr,rr)/r, and the screening particle density is predicted to be Zq7p exp(—qrpr)/r. 
This result is not plotted in Fig. 5.17. The rest, which are shown in Fig. 5.17, have to be 
obtained by numerical Fourier transform, which was first done by Langer and Vosko (1960) 
for the RPA. All curves are very similar, except for Thomas—Fermi, which is the only one to 
diverge at r > 0. The RPA (1), Hubbard (2), Singwi-Sjolander (3), and Singwi et al. (4) 
results all show Friedel oscillations which are similar. They are lacking in the Thomas—Fermi 
theory. The Friedel oscillations are real features of impurities in metals, so that the Thomas- 
Fermi theory is deficient in several respects. However, the other theories predict remarkably 
similar results, which should not be surprising. Screening is basically a one-body property, 
and little correlation is evident in one-body amplitudes. The effects of Gy(q) are much more 
apparent in properties involving two-body correlations. 


5.6.3. Correlation Energies 


A knowledge of &(q, œ) is sufficient for predicting the correlation energy of the electron 
gas. This fact should be expected, since the correlation energy arises from the correlation hole 
around the electron. The exchange-correlation hole was put into the derivation of Gy(q). Now 
e(q, œ) is manipulated to obtain the correlation energy. The discussion follows Singwi et al. 
(1968). 

The Coulomb interaction energy of the electron gas may be written as 


1 
z Ža v,P(a@)p(—q) (5.239) 


This way of writing the interaction allows an electron to interact with itself. These contri- 
butions should be omitted. The average value of p(q)p(—q) in the electron gas is 


(PDPC D) = NAN -Sq=0 + S4) — 1] (5.240) 


The prime on the bracket means one is to omit the particle interacting with itself. It explains 
why this differs by —1 from the similar average in (5.111). This equation may be derived by 
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FIGURE 5.16 The pair distribution function g(r) for the electron gas at various densities for several model dielectric 
functions. Source: Singwi et al. (1968) (used with permission). 


following the same steps used to get (5.209). The interaction energy E,,, per electron of the 
electron gas is 


N 3 
NEn = 52 5 SO- 11 = 2neN, Eha- (5.241 
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FIGURE 5.17 Screening charge density near a point impurity in the electron gas with r, = 3 according to several 
model dielectric functions: (1) RPA, (2) Hubbard, (3) Singwi—Sjélander, and (4) Singwi et al. Source: Singwi et al. 
(1970) (used with permission). 
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The summation has been converted to an integration over q. In doing this step, remember that 
the q = 0 term is not in the summation, so that the term N.Sg-o has no effect. The quantity 
E,,4(€7) is not the Coulomb contribution to the ground state energy. To obtain the ground state 
energy per particle &,, one must do a coupling constant integration of the type discussed 
earlier in Sec. 3.6: 


dh, 


E, = 2 Ero + | T Pint (A) (5.242) 


0 
This equation may be derived starting from (3.299), which is assigned as a problem. Equation 
(5.242) is an exact result for the ground state energy of the homogeneous electron gas. One 
must calculate the Coulomb interaction energy for each value of À up to e°. In practice, this 
calculation is the same as finding S(q) as a function of density. The first term ŻE ro 1s the 
average kinetic energy, which is calculated assuming that e? = 0. 

The angular integrals in the definition (5.241) of E; „ may be done, yielding 


2 poo 
EuT aso- (5.243) 


This expression depends on e? through the prefactor on the right. It also has a dependence 
through S(q), which is decidedly dependent on electron—electron interactions. It is conven- 
tional to introduce a dimensionless function 


1 oO 
=- I dls) — 1] (5.244) 


which depends on density. In the coupling constant integration (5.242), y also depends on À. 
The interaction energy may be written as 


2 

E (e) = -ze key (5.245) 
4 (9n\'” 

—-——(—) y 5.246 

Tr, ( 4 ) Try ( ) 


The result has been put into Rydberg units in the second line by using krag = (9n/4)'” [r 
Next examine the coupling constant (e°) dependence of these terms. The Rydberg energy Ey 
is proportional to ef or 7. Similarly, r, œ à since r, « aj! and the Bohr radius is inversely 
proportional to e?. Write these dependences as r, —> Ar, or e+ —> Xet, where 0 < À < 1. The 
interaction energy in this notation is 


On 


4 1/3 
Ein) = —h— (=) (Mr, (5.247) 


The factor y is also dependent on density, which we write as y(Ar,) or as y(r,) when A = 1. 
The coupling constant integration may be done, at least formally, and the ground state energy 
per particle may be obtained from (5.242): 


22099 4 (9r 
E á p tr, \ 4 


1/3 ol 
| dny(Mr,) (5.248) 
0 


The coupling constant integration involves only the factor y. All the other 4’s have factored 
out of the expression. Of course, y was chosen so for this reason. The correlation energy 
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consists of all contributions to the ground state energy except kinetic and exchange. Thus the 


correlation energy is 
E = 0.9163 4 (=) 
r mr, \ 4 


S 


1/3 pl 
| diy(Ar,) (5.249) 
0 

One can verify that this vanishes when S(q) and y(r,) are calculated in the Hartree-Fock 
approximation. This expression is evaluated numerically after obtaining S(q) and y(r,) as 
functions of density. Some results are given in Fig. 5.18. Four curves are shown, corre- 
sponding to RPA, Hubbard, Singwi—Sjélander, and Nozieres—Pines. The last is included to 
provide a comparison with the earlier plot of the correlation energy in Fig. 5.6. The RPA 
curve predicts a correlation energy with a magnitude larger than the others. The other three 
are similar and also similar to that of Wigner. The correlation energy may be obtained with a 
knowledge of the static structure factor S(q) if it is known for all densities. The Singwi— 
Sjölander correlation energy is presumed best, because of its success in predicting a positive 
g(r). The rather direct relationship between g(r) and S(q) implies that the best g(r) is the best 
S(q), which in turn is the best correlation energy. 

The correlation energy is negative. Correlation lowers the energy of the interacting 
electron gas, and increases its binding. This result is surprising from a naive viewpoint. The 
main contribution of the correlation energy is to screen the electron—electron interactions. 
Whereas the Hartree-Fock approximation has a bare Coulomb interaction v,, the exchange 
and correlation energy has the screened interaction v,/e(q, œ). The naive viewpoint is that 
screening weakens the interaction, which lowers the magnitude of the binding energy. That is, 
if a Thomas—Fermi dielectric function is employed, then e(q) > 1 so v,/e(g) < v, and 
|E.c| < |El. The Thomas—Fermi model for the dielectric function actually predicts a positive 
correlation energy. The magnitude of the binding energy is reduced. The missing variable is 
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FIGURE 5.18 Correlation energy of the electron gas according to different model dielectric functions: RPA, 
Hubbard, and Singwi et al. Also shown is the interpolation formula of Nozieres and Pines to facilitate comparison 
with Fig. 5.6. 
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plasmons. The proper dielectric function has plasmon poles, and these excitations strongly 
couple to the electron. The interaction with the plasmons provides a significant contribution to 
the binding energy, and makes the correlation energy negative. 


5.6.4. Compressibility 


The compressibility of the electron gas is defined as follows. Let E be the total energy of 
the system, which was calculated in Sec. 5.1. It is an extrinsic quantity; i.e., it is proportional 
to the number of particles N, in the system of volume v. The pressure P is defined as the rate 
of change in £ with volume at constant N,. The inverse compressibility is the rate of change 
of P under the same conditions: 


P= -(F) (5.250) 
dv N, 

1 dP 

— = — — Ld 1 

4 (a), (5.251) 


The definition of K~! is multiplied by v so that K is not dependent on the size of the 
system—it is not dependent on N,. The compressibility is usually compared to that of a free- 
particle Fermi gas at zero temperature. There the only energy term is the kinetic energy, which 
was shown in Sec. 5.1 to be 


3H? [BRNA 
E=2E,N, =—— £ 5.252 
5HF+*e mN V ) ( ) 
Differentiate this equation twice with respect to v, while keeping N, fixed, and find 
AŽ 22/3 [Ne a 
1, h? N, [BNN T 
— = $ Ern = —— | —— 5.254 
Ky 3 PPM a (=) N ) 


The free-particle compressibility is K; = 3/(2Epno). Most of the numerical results are 
presented as the ratio K;/K, where Ky is the above result. An effective mass of unity is 
assumed. 

This procedure for finding the compressibility may be generalized. Ignoring surface 
effects, the total energy of the system is N, times the exact ground state energy per particle 
E = N.E,. For a fixed N, the only volume dependence is through the density dependence of 
E,(no), where no = N,/v. Twice differentiating the total energy yields 


dE dE 
E 2 £ 
= —N,—£ = n — 2 
P N; dy No No (5 55) 
1 dP\ —— aP(m) naom | 3 
K = (a), No dn, = dnok, + nEs (5.256) 
2 
=n Ë (mE,) (5.257) 
dno 


The last line shows that the compressibility may also be represented as the second derivative 
of the quantity f (nọ) = noE,. Since E = NE, = vf, f (no) is just the energy per unit volume. 
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The compressibility is easily found as the second derivative of this quantity multiplied by nê. 
The calculation of the ground state energy E, was described in Sec. 5.6.3. Using this method, 
one may find f (nọ) and its derivatives numerically. 

Another relation for the compressibility may be obtained by using the theorem of Seitz, 
which was given in (5.34): 


df (no) 
= 2 5.258 
u dn, ( ) 
1 > du 


Here u is the chemical potential of the electrons, evaluated ignoring surface effects. The 
compressibility is related to the change in chemical potential with density. In some cases it 
may be easier to calculate the properties of a particle on the Fermi surface rather than the total 
energy. In particular, only a single derivative is needed to evaluate (5.259). 
As an example of using this formula, the compressibility is calculated in the Hartree— 
Fock approximation. The starting point is the energy of an electron at the chemical potential 
272 
= nke eke (5.260) 
2m T 
It is a quadratic equation for kp(u). Solve it, and then find the electron density from 
no = kè /3n?, where ay is the Bohr radius 
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The Hartree-Fock prediction is that the inverse compressibility is a linear function of r,, and 
becomes negative around r, = 6.02. Other predictions are given below. 

Another method of evaluating K is through the compressibility sum rule. This rule is an 
exact relationship between the compressibility and the long-wavelength limit of the static 
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dielectric function (see Nozieres, 1964). Take the longitudinal dielectric function &(q, œ), set 
œ = 0, and then take the limit q — 0. In this limit, one obtains 


Ane? 
lim e(q, 0) = 1 + =£ nK (5.266) 
q—>0 q 


By substituting the free-particle compressibility Ky = 3/(2nEp), the compressibility sum 
rule may also be written as 


iia K 
li „0 = 1+5 ji .26 
lim e(q, 0) + P AK; (5.267) 


The RPA predicts K/K; = 1, since &gp, > 1 + q7-/q’. It also provides a method of relating 
K to G,,(q). If one takes the limit q — 0 for the dielectric function (5.187) and remembers 
that v,P\) + —q},/q’, then 


v,P® 
lim e(q, 0) = 1 - ——— A (5.268) 
q—>0 1 + Glg), P® 
2 2 
-1p ÉI (5.269) 


1 — qi Gy(q)/¢" 


The function Gy(q) vanishes as O(q?), so that the limit Gy(q)/q? yields a constant, which is 
called a/k2. The constant a is dimensionless. The Hubbard theory predicts that a = L, so the 
compressibility is 


Ke Gy(q) 
f 2 Vad | _ {TF 
K 2 
(2) — | -5() -1-5 (5.271) 
K Hubbard 2 kpr 3.01 


The constant a may also be obtained from the Singwi-Sélander theory, where Gy(q) is given 
by (5.198): 


d'k q'k 
2n) ke 


G,(q) = — =| [Sa - k) — 1 (5.272) 


Change variables of integration to Q = q — k, and evaluate the expression in the limit where 
q — 0: 


3m q(q — vO) 
E A dv ge NG 
q? 
— -76 | QO’ dO[S(Q) — nf dv ae — 2v*) + ona") 
q? 
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The combination of factors on the right is the same as in the definition of y in (5.244). The 
Singwi—Sjolander theory predicts that a = y or that 


Af (=) (5.274) 
K e 

The value of a may be deduced from Table 5.2, since the limit q —> 0 yields a = AB, where A 
and B are the constants listed. This value of a varies with density, but it is generally smaller 
than the Hubbard value of 4 and is nearer to 4. 

Two ways have been given to evaluate the compressibility. The first is by finding the 
ground state energy per particle and differentiating twice. The second is from the long- 
wavelength limit of the dielectric function. The two methods would give the same result if all 
quantities could be found exactly. But when using approximate theories for £€(q, œ), the two 
methods of finding the compressibility will give different results. For example, consider the 
compressibility in the Hartree-Fock approximation. The ground state energy now includes the 
exchange term, and a second derivative of the ground state energy gives a = l as derived 
earlier. But when the exchange hole is used to get g(r), as at the end of Sec. 5.1, and then 
S(q), and then Gy(q), and finally a, a different result is found: a = 3. The Hartree—Fock 
approximation predicts different results in the two methods of calculation. The form of G,(q) 
suggested by Vashishta and Singwi (1972) and Schneider et al. (1970) has the virtue that it 
gives almost the same compressibility by both methods of calculation. Denoting their func- 
tion as G,,(q) while G(q) is the function in (5.198), they are related by 


d 
GQ) = (: + 2ng x) Gq) (5.275) 


The derivative is taken with respect to density. This form is regarded as being particularly 
accurate for the compressibility. 

Figure 5.19 shows some theoretical compressibilities as a function of r,. The solid lines 
are results obtained by differentiating the ground state energy. All the theories except Hartree— 
Fock predict identical compressibilities when evaluated in this fashion. And the Hartree-Fock 
result is not very different. Presumably the curve labeled RPA, SS (for Singwi—Sjélander), 


COMPRESSIBILITY 


FIGURE 5.19 The compressibility K of the electron gas compared to that of the noninteracting gas Ky. The five 
theories shown are Hartree-Fock, RPA, Singwi—Sjélander, Hubbard, and Vashishta—Singwi. Each theory makes two 
predictions: the solid line from the derivative of the ground state energy and the dashed line from the compressibility 
sum rule. 
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Hubbard, or VS (for Vashishta—Singwi) is the proper result, since it is given by so many 
theories. 

The dashed lines are the results obtained from the compressibility sum rule: from the 
limit of e(q) or Gy(q) as q > 0. These vary widely among the theories, except for the 
coincidence of Hubbard with the original Singwi—Sjélander result (see Singwi et al., 1968). 
The Vashishta—Singwi curve is nearly identical to that calculated by the other method. 

The compressibility is predicted to go negative about r, = 5. This result suggests that the 
electron gas is unstable at densities lower than this critical value. Real metals exist with larger 
values than r, = 5; i.e., Cs has r, = 5.63, but they are not a homogeneous electron gas. 
However, metals do not exist with r, values greater than Cs, which implies that there may be a 
fundamental limit beyond which the density may no longer be reduced. Unfortunately, this 
point cannot be tested in three dimensions. 

The two-dimensional electron gas has a negative compressibility for r, above a value of 
2.2-2.3. This prediction can be tested. A quasi-two-dimensional gas can be created in a field- 
effect transistor. The electron density in the surface layer can be controlled by a gate voltage. 
In silicon devices with very high mobility, it has been found that there is a critical density of 
n, = 0.845 x 10!!cm-? (Kravchenko et al., 1994, 1995). Above this density the layer is 
metallic, while below this density the layer is insulating. The properties of the insulating state 
are still under investigation. 


5.6.5. Pauli Paramagnetic Susceptibility 


The Pauli susceptibility is another one of the many parameters of the electron gas which 
are calculated using Green’s functions. The susceptibility of a free-electron metal such as 
sodium or aluminum has a number of contributions. One is from the ion cores, and another is 
the orbital diamagnetism, which is called Landau susceptibility. These two effects must be 
subtracted from the experimental value in order to deduce the Pauli part, which is due to the 
spin of the electron. 

One way to calculate the Pauli susceptibility is by the use of the Kubo formula derived in 
Sec. 3.9.1, 


B 

Yap(4, i0) = | dre" (T.M,(q, )Ma(—4, 0)) (5.276) 
0 

MD = Ho E Sse Cptao Cpo (5.277) 


where o are the Pauli spin matrices. Up or down spin are signified as o = 1 for the s = 5 


system. Evaluate the static susceptibility, which has ia = 0. The Hamiltonian is the homo- 
geneous electron gas. Consider the type of terms which occur when expanding the S matrix. 
The first term, which has no interactions, was evaluated in Sec. 3.9.1. It gave the result for the 
free-electron gas. which is called %p = Np. The first interaction term is of the form 


1 B B 
lim x(q) = Xr — — v | d| dt 
q—0 q) Kr 2v 2 2 q 0 0 l 
x (T,M_(q, DON a. o(T1) Chg o (t1)Cko(t1)Cpo(t1)M;(—4, 0)) (5.278) 
M,(q) = Ho » SC a oCpo (5.279) 
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FIGURE 5.20 


The several connected terms which result from this contribution are shown in Fig. 5.20. The 
free-electron term is given by the bubble diagram shown in Fig. 5.20(a). The vertices are 
labeled s to indicate that they contain a spin operator and so are not the same vertex as found 
for the density or current operators. Figure 5.20(b) contains the exchange self-energy 
diagrams for the electron Green’s function. These can be incorporated by using Y instead of 
G) for this propagator, where Y contains the self-energy terms of the electron gas. Figure 
5.20(c) has a Coulomb line connecting the two propagators. It is called a vertex correction. It 
is actually a type of exchange scattering, which contributes to the local field correction G;(q) 
of Hubbard and Singwi-Sjoélander. Figure 5.20(d) has two closed loops, connected by a 
Coulomb line. This diagram is zero. It vanishes because each bubble has the correlation 
function of an M, operator and a density operator 


(T.M,(q, t)p(—q, 0)) = 0 (5.280) 


This combination is zero because of the averaging over o = +1. All diagrams with more than 
one bubble are zero, because the Coulomb interaction is spin independent. Therefore the 
correlation function is limited to terms with a single bubble, which has an s vertex on each 
end. The sum of all such contributions is indicated schematically in Fig. 5.20(e). This bubble 
diagram is called P(q, œ). It is not the same as the exact P(q, œ) which enters into the 
definition of the longitudinal dielectric function, since the latter may have contributions from 
more than one bubble if connected by two or more Coulomb lines. 

One possible approximation for P is to use the Hubbard or Singwi-Sjélander theories: 


HP” (g, io) 


P(g, io) = Ho? aio 
410) = TG (POG, 10) 


(5.281) 


As discussed in Sec. 5.5, the local field correction for spin G,(q) is different than the one 
G,,(q) for charge fluctuations. For the static susceptibility y,, = y, take the limit iœ = 0 and 
then the limit q — 0. In this limit P“ goes to minus the density of states Np = 
mkr /T?, Xp = MNF: 
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In this approximation, the ratio y/y, is similar to the ratio of the compressibilities K /Ky 
which was found in Sec. 5.6.4. However, the spin parameter a, is not the same as the charge 
parameter. 

The local field correction for spin Gs(q) has also been extensively investigated. Its exact 
value is known in the limit of large (q, œ) from Santoro and Giuliani (1988), and Vignale 
(1988). These results are presented in Table 5.3. They were derived in the same fashion as the 
similar results for charge fluctuations. 


5.7. SUM RULES 


The longitudinal dielectric function e(g, œ) of the homogeneous electron gas has some 
exact moments which are useful for checking results. Some of these have already been used 
and mentioned. For example, the long-wavelength limit is lim,_,9 €(q, @) = 1+ vgngK ; 
where K is the isothermal compressibility. Similarly, the high-frequency limit is 


o2 
: — P 
Jim &(q, œ) = 1 z (5.283) 


where œ, is the plasma frequency. The frequency œ is large when it is larger than other 
energies in the system, such as the plasma frequency or the Fermi energy. 

Another class of exact results are called sum rules. The “sums” are actually integrals 
over frequency. Several of them are 


l dows- (q, ®) = 5% (5.284) 
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These last two identities are exact results. The validity of the first two are debated. The last 
two are rigorous since there is a density—density correlation function for the inverse of the 
dielectric function, which means that its retarded function is absolutely causal. However, there 
is no correlation function which defines e(q, œ). Its retarded function is not required to be 
causal. However, it is hard to find examples where it is not. 

Approximate formulas for (q, œ) may not automatically obey these relationships. 
Indeed, approximate formulas are considered virtuous according to how accurately they 
satisfy these identities. The sign convention has €,(g, œ) > 0 for œ > 0, which implies that 


Im(1/e) = —€,/e|* is a negative quantity for œ > 0. The last two sum rules may also be 
written as 
(0 @) 
E2 T 2 
dow ——— = = 5.288 
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These exact results may be proved by a variety of methods. The third relation (5.286) is often 
called the longitudinal f-sum rule. It may be proved from the double commutator 


C = ([[H, p(a)], p(—q))) (5.290) 


where the average is taken, at nonzero temperature, over the thermodynamic states of the 
system. The Hamiltonian H is for the full homogeneous electron gas. However, the density 
operator p(q) commutes with all terms in this Hamiltonian except the kinetic energy. The first 
commutator was evaluated in Sec. 5.5.4. 


[H, p(q)] = xo p+q.oCpa(Ep+q — Ep) (5.291) 
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The term proportional to p(q) will commute with p(—q). Only the other term needs to be 
further evaluated, 


1 
= ([[Æ, p(q)], p(—q]) = m (x q' PICh 4,oCpo> x-a) 
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Llc oip- q) 'q-p: = -£Dn = -=N, (5.293) 


and yields the simple result that this double commutator is just —N,q*/m. Next, this double 
commutator is shown to be proportional to the sum in (5.286). It is evaluated by inserting the 
complete sets of states |n}, |m}, which are exact eigenstates of the Hamiltonian, and then 
collecting terms: 


C = (IH, p], p(—a)]) = fe?” (nL, p(q)] lm) (ml p(—q) In) 


— e PEm (m|p(—q)|n) (IL, p(q)]|™)} 
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Farlier in (5.115) it was shown that 
1 v 
m| oo = —n(1 — e hey 2 e Pn | (n|p(q)|m)|?5(@ + E, — En) 


so that the sum rule is 
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Compare the sum rule with the double commutator, and note that they are proportional, which 
provides the derivation of the sum rule (5.286): 
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The integral is from —oo to oo, but the negative and positive parts contribute equally, so the 
left-hand side is twice the integral from 0 to oo. 

The other three sum rules may be derived by using the Kramers—Kronig relations 
(Kronig, 1926; Kramers, 1927). Let B(w) be a function of complex variable œ, which is 
analytic in the upper half plane. Analyticity means that it has no poles or branch cuts above 
the real axis. It is also assumed that B(w) > 0 as |a| > ov. 


B(@) = al dt' B(w')P 


/ 


(5.296) 


The integral is along the real axis. On the real axis, the function B(œ) has real and imaginary 
parts, which are called B, and By, so that B(œ) = B, (œ) + iB,(@). They are related by taking 
the real and imaginary parts of (5.296): 


l o / / 
l ' / / 


These two identities are the Kramers—Kronig relations. They are useful throughout all areas of 
physics. The present interest is in applying them to the real and imaginary parts of the 
dielectric function e(q, œ) = €, + ie,. Another application is to define the refractive index 
n(œ) and extinction coefficient k(@), where ñ = n + ik = ,/e. The complex refractive index ñ 
obeys the conditions of the theorem and hence the preceding identities. There are other 
applications in optics, scattering theory, and virtually all branches of physics. All of these 
applications, including the present sum rules, assume that the dielectric function is causal so 
has no poles in the upper half plane of frequency. This assumption is not firmly established for 
€ but is for 1/e. 

The other sum rules are easy to prove as simple applications of the Kramers—Kronig 
relations. First take the case where B(w) = 1/e(g, œ) — 1. The —1 term is included to make B 
vanish as |@| — oo. Using the first Kramers—Kronig relation gives 


1 1fe l 1 


The right-hand side may be simplified by using the fact that Im[1/e(g, œ] is asymmetric in 
frequency. The integrand (—oo, 0) is evaluated by changing œ — —q@ and combining this 
with the (0, co) part to give 


1 2 [%2 1 o 
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This identity is also a sum rule but not one of the simple ones in (5.284)(5.287). However, 
they are special cases of this result. First take the limit where œ — 0: 
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The quantity e(q, œ = 0) = &(q) is real, so (5.301) expresses the static dielectric function as 
an integral over the loss function. The sum rule (5.287) is obtained by taking the limit where 
q — 0. The quantity (q) is evaluated from the compressibility sum rule (5.266), 


1 
lim e(q) = 1 + vnoK > (=) > CO (5.302) 
q—0 q 
im- = tim 2 — (5.303) 
q>oelq) q>0q? +4nenkK ' 
which shows that the left-hand side of (5.301) is zero. The terms on the right give 
© da! 1 T 
li — Im = —— 5.304 
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which is (5.287). 

The longitudinal f-sum rule was proved by taking commutators. The remaining two sum 
rules, (5.284) and (5.285), are obtained from the Kramers—Kronig relations using £ — 1 for 
B(w). Again the 1 term is included so that £ — 1 vanishes at large values of |@|. From (5.298) 
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0 
The second equation here is derived using the fact that £- (q, œ) is antisymmetric in frequency. 
This equation is evaluated for œ = 0: 
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Equation (5.307) is an exact relation between the imaginary part of the dielectric function 
€,(q, œ) and the static part ¢,(qg). In the limit where q — 0, the compressibility sum rule 
(5.302) gives e; = 1 + v$K, so that 


Od / 
lim | — e (q, œ) = Ta nK (5.308) 
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which establishes (5.285). 
The remaining sum rule (5.284) is found from the limit œ —> oo in (5.306): 
: 2 9 TAN A / l 
dm €5(q, 0) = 1 — al dm œ (q, © ) + (=) (5.309) 
By using (5.283), the left-hand side becomes 1 — @?/m*, and equating terms of order œ~? 
produces the result 


| do’ w'e,(q, œ -7o (5.310) 
0 


All four sums rules have been demonstrated. These are not the only sum rules which exist. 
Others are sometimes found useful, in particular the third moment (Mihara and Puff, 1968; 
Hopfield, 1970). 

It is curious that the first moment sum rules of both €,(qg, œ) and Im[1/e(q, œ)] yield the 
same result (7/ 2)o%. And this result is independent of q. One should not conclude that £, and 
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Im(1/¢) are identical. Earlier, in Fig. 5.11, it was shown that these two functions are similar at 
large q but are quite different at small values of q/kp. 

Plasmons play a particularly important role in these sum rules at small values of q. In the 
limit where q — 0, from (5.170) then 
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For a retarded function, let œ —> œ + ið, so that the imaginary part of this expression gives 
, l TO, 
lim Im G, ©) = — Zz" [blo — @,) — 0(@ — @,)] (5.314) 


This relation is rigorously valid only in the limit where q —> 0. Nevertheless, it is curious that 
it satisfies the last two sum rules exactly: 
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This result shows that plasmons provide all the contributions to Im[1/e(q, œ)] in the limit 
where q — 0. Since the interpretation of Im[1/e(q, @)] is that it describes longitudinal 
excitations of the electron gas, plasmons are the only excitation in the limit where q — 0. 
Any other excitation would make a contribution to the sum rule, which is impossible since 
plasmons provide the entire result. Any other excitation has zero strength in the limit where 
q— 0. 

The single-mode approximation is to assume the spectral function is given by a single 
excitation @,(q). The approximation is only accurate in the electron gas at small values of 
wave vector q, but is often used for all wave vectors. It will be used again in Chapter 11 in 
describing the excitations in correlated liquids. 


5.8. ONE-ELECTRON PROPERTIES 


This section discusses the single-particle properties of electrons in the homogeneous 
electron gas. They include the effective mass and mean free path. The present section is put at 
the end of this chapter deliberately to make a point: the one-electron properties are not very 
important for discussions of most features of the electron gas. The discussions of ground state 
energies and pair correlations did not rely significantly on the properties of the individual 
electrons. The excitation spectra of the electron gas are also not very dependent on the 
properties of single electrons. In fact, the converse is true: the collective properties of the 
electron gas need to be known in order to discuss the behavior of a single electron. This single 
electron interacts with the other electrons, which are represented as collective excitations. The 
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order of presentation has been to first describe the collective excitations. It was done assuming 
that the particles were nearly free (i.e., RPA) or had some correlation (i.e., Hubbard, Singwi— 
Sjölander). These calculations ignored the finite mean free path, or possible effective mass 
changes, of the electron. They do not change these collective properties to a significant 
degree, as long as the mass changes are small, and the mean free path is long. 

The original assumption is that electron states in the electron gas are noninteracting 
plane waves, and the final description is not very different. The self-energy effects do not alter 
this basic picture. 

If the calculations were to show that something drastic happened to the individual 
electrons—that they were magnetic or superconducting or semiconducting—obviously the 
collective properties would be altered in a significant fashion. It is important that the initial 
eigenstates are an approximate description of the actual physical system. It is only in this 
circumstance that one finds that the one-particle self-energies have little influence on 
collective properties. 

The one-electron properties are obtained from a calculation of the electron self-energy 
due to electron—electron interactions. This self-energy was discussed earlier, in Sec. 5.1, in the 
discussion of electron correlation. If we had chosen to follow the original discussion of Gell- 
Mann and Brueckner (1957), the correlation energy could be derived without first discussing 
the electron self-energies. The point is made again that the collective properties are not 
significantly dependent on first deriving the one-electron properties. The discussion of 
electron correlation followed Quinn and Ferrell (1958) and first considered the electron self- 
energy. The results of that discussion can be used as the starting point for the present deri- 
vations. The best calculations which have been reported of single-electron properties calculate 
only the screened exchange interaction. 

The electron self-energy from the screened exchange energy is 


1 Bq Tq, idm) 0 
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The integrand contains a vertex function I (q, iq), as discussed in Sec. 5.5. Calculations 
using (5.317) are shown in Fig. 5.21. Results are obtained by a numerical evaluation of the 
multidimensional integral. Figure 5.21(a) shows the real part of the self-energy X „„ as a 
function of k for r, values appropriate to common metals. All of the curves have a similar 
shape. The strong dip in the curve is due to the coupling of the electron to the plasmon. The 
imaginary self-energy Im[2..] has a big jump in magnitude when the electron has enough 
kinetic energy to emit a plasmon. The Kramers—Kronig relation between Re[X] and Im[2] 
guarantees that Re[] has a dip when Im[È] has a jump. 

A similar dip in the self-energy is obtained whenever the electron couples strongly to an 
oscillator. The self-energy of electrons coupled to LO phonons in insulators has the same 
shape, as discussed in Problem 7.7. Of course, there the dip happens at the energy char- 
acteristic of the phonon, while here it occurs at the energy of the plasmon. 
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FIGURE 5.21 (a) The real part of the screened exchange energy &,,(k, €,) as a function of k for values of r, 
appropriate for simple metals. The dip is caused by plasmon emission. Source: Shung et al. (1987). (b) The change in 
energy bandwidth for electrons interacting with electron—electron interactions. The RPA result (G = 0) is compared 
to the G,,(q) of Vashishta and Singwi. The results are quite similar. Source: Mahan and Sernelius (1989). 


The electron self-energy also contributes to the energy width of the occupied band of 
electrons. This change in width can be crudely estimated as the difference between the self- 
energy calculated at k = 0 and at k = kp. 


AW = E, (kr, 0) — Re ,,(0, —Ep) (5.320) 


The energy € = 0 at the Fermi energy, and equals —E, at the bottom of the band. Figure 
5.21(b) compares AW calculated using RPA and the Vashishta—Singwi (VS) form for G,(q) 
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given in Table 5.2. The results are quite similar. The local field factor of G,,(q) has only a 
small influence upon the bandwidth of the electrons. The curve labeled = 1 uses G,(q) in 
&(q, œ) but not in I’. This inconsistent theory is called the “GW” approximation. 


5.8.1. Renormalization Constant Z, 


The renormalization coefficient of the one-electron Green’s function is defined for a 
point (k, € = €,) as [see (3.155)] 


l 


A) = Taek, tha, 


(5.321) 
Usually the retarded self-energy 2... is complex, as is the renormalization coefficient. At the 
particular point, k = kp and € = 0, the self-energy £. is exactly real, since Im(2) vanishes. 
Electrons on the Fermi energy have an infinitely long mean free path, or lifetime. They have 
no unoccupied electron states into which they might scatter. The renormalization coefficient is 
real at this point. At the Fermi energy it is called Zp = Z(kp) 


1 


fr ~ [1 ~ 0X,,(Ap, €)/0E],—9 (5-322) 


The renormalization coefficient is interpreted as the amount of single-particle behavior of the 
particle-like excitation in the electron gas. Since the excitations behave like particles, they are 
called quasiparticles. Generally the spectral function 4(k, €) has a central peak plus some 
smooth background wings. This behavior is illustrated schematically in Fig. 5.22. The 
renormalization coefficient is defined as the area under the central peak of the spectral 
function. If the peak is broad, then the definition is ambiguous, since the limits of the central 
peak are ill-defined. Precisely at the Fermi surface the quasiparticle central peak is a sharp 
delta function, because Im È = 0. Here the renormalization coefficient has a precise meaning: 
the spectral weight under the delta function peak. As discussed in Sec. 3.3, the renormali- 
zation coefficient Zy must be less than or equal to unity since the total area under A(f, £) is 
unity. 

The quantity Zp has an additional interpretation, which can be measured experimentally. 
The momentum distribution of the electron gas is given by [see (3.135)] 


d 
n, = ZU e)np(8) (5.323) 
21 
Actual results for an interacting electron gas have the form illustrated by the solid line in Fig. 


5.23 (Daniel and Vosko, 1960). The dashed line is for a noninteracting Fermi system, which 
has all the particles below k = kp at T = 0. For the interacting Fermi system, even at T = 0, 


A(p,w) 


UJ 


FIGURE 5.22 Spectral function of A. 
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FIGURE 5.23 Momentum density of the homogeneous electron gas for r, = 3.97. Source: Daniel and Vosko (1960) 
(used with permission). 


the momentum distribution is the solid line, with some components for k > kp. The quantity 
n, Should be distinguished from the energy distribution np(£) = 1/(e** + 1), which is a sharp 
step at zero temperature. The renormalization coefficient Zp is the magnitude of the step at 
k = ky in the momentum distribution n,. This assertion is easily shown using the definition of 
n, and the fact that 


A(ky, €) = 2nZ,-d(€) + f (£) (5.324) 


where f (e€) is a smooth function. The step in the momentum distribution in the Fermi gas may 
be measured in actual metals by Compton scattering. 

There have been two major types of calculations with these formulas. The first is the 
RPA, where we use Hedin’s (1965) results. The other is the revised Hubbard model work of 
Rice (1965). The two calculations of Zp are shown in Table 5.4 for different values of r,. Rice 
only reported values of r, < 4. The two calculations are in very good agreement. The two 
models of the dielectric function make the same prediction regarding this quantity. 

Eisenberger et al. (1972) reported Compton scattering experiments in metallic sodium 
and lithium which demonstrated the existence of the high momentum tail in n}. Compton 
scattering is the inelastic scattering of photons by the electrons in the solid. The photons are 
scattered by all the electrons, both in core states and the conduction band. The conduction 
band profiles, which are our present interest, are obtained after subtracting the core scattering. 


TABLE 5.4 Zp as calculated by (a) 
Hedin (1965) and (b) Rice (1965) 


r, RPA“ Hubbard’ 
0 1 1 

l 0.859 0.87 

2 0.768 0.77 

3 0.700 0.70 

4 0.646 0.63 

5 0.602 

6 0.568 
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The best results are obtained for metals with low atomic number, since they have fewer core 
levels to subtract. Eisenberger et al. analyzed their data in terms of the sudden approximation. 
This theory assumes that the scattering rate of photons is proportional to 


d?o x| dp 
dodQ ~ } (2n) "p 


where œ is the energy transferred by the photon and q is the momentum transferred. The 
electrons are scattered from p to p + q, and the average is taken over the initial distributions 
n, of the interacting electron system. When the angular integral is done to eliminate the delta 
function, it is found that the experimental quantity is only a function of the combination of 


variables called Q = (m/q)(E, — ©): 
1 
_ PqVv 
| dvi(o—e, —) = Z Olp — 10) 


Np WW + Ep — Ep+q) (5.325) 


do 

Jodo X J(Q) (5.326) 

1(Q) = | pdpn,, (5.327) 
IQI 


For a noninteracting electron gas, one has that n, = ©(kp — p) and the scattering function 
J(Q) is an inverted parabola: 


J(Q) =4 (ke — O°) O(ke — 101) (5.328) 


Thus J(Q) is shown as the dashed line in Fig. 5.24(a). The solid line shows the type of 
behavior expected for an interacting electron gas. There is a long tail at high values of Q, 
which arises from the similar high tail in n, for values of p > kp. 

The experimental and theoretical results of Pandey and Lam (1973) are shown in Fig. 
5.24(b). The triangular points are the experiment, and they are joined by the solid line. The 
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FIGURE 5.24 (a) The dashed line shows the momentum distribution in Compton scattering from a noninteracting 
electron gas, and the solid line is the prediction for an interacting system. (b) The experimental results in metallic 
sodium (solid points) compared with the theory (open triangles) for this metal. Source: Pandey and Lam (1973) (with 
permission). 
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circular points are the theory. The agreement is obviously excellent. Sodium is the ideal metal 
for testing the distribution n, since its Fermi surface is quite spherical. Other experimentalists 
(Eisenberger et al., 1972) find that the Compton scattering depends only upon Q, which can 
be tested by different choices of q and œ. 


5.8.2. Effective Mass 
The formula for the effective mass was given previously in (3.160). It will be evaluated 


for electrons on the Fermi surface of the metal: 


(=) _ - + dX(k, &)/d€, 
m* 1 — dX(k, €)/de 


3 
= Zs 4 Ou, =| (5.329) 
k=k,,e=0 


k=kp,e=0 OE; 


The results of Rice were obtained by a slightly different formula, which should give similar 
results. His values are shown in Table 5.5. The ratio m/m* is less than unity for values of r, 
less than 2, while it is slightly larger than unity for larger values of r,. The effective masses of 
electrons on the Fermi surface of the metal may be measured quite accurately by many types 
of cyclotron resonance experiments. The effective mass values for real metals are determined 
by three major contributions. The first is the electron—electron interactions, and they make 
only a small contribution to the mass. The second is due to band structure, which is quite 
variable. It is small in sodium but large in lithium and many other metals. The third 
contribution is due to electron-phonon interactions, which is quite sizable. A comparison 
between theory and experiment is deferred until the phonon contribution is calculated in Sec. 
7.4. 

Another application of the effective mass theory is in the low-temperature specific heat, 
or heat capacity. The free-electron theory of metals predicts that the specific heat of a metal, at 
low temperature, is linear in temperature. A free-electron gas of N, particles has the result 


C o 31? 2 m 
o= g NAT om 


For an interacting system, the prediction is that the free-electron mass is replaced by the 
effective mass at the Fermi energy m*. The ratio of the linear term in the specific heat, at low 
temperature, is just the ratio of the effective masses: 

C,  m* 


= 5.331 


(5.330) 


The quantity C,)/C,, is calculated in a similar manner as m/m*. This ratio is also influenced 
by phonon effects. 


TABLE 5.5 Results of Rice 
(1965) for m/m* using the 
Hubbard dielectric function 


r; Hubbard 
0 1 

1 0.96 

2 0.99 

3 1.02 

4 1.06 
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5.8.3. Mean-Free-Path 


The mean-free-path /, is the average distance the electron travels between scattering 
events. Similarly, the lifetime t, is the average time between scattering events and is related 
by J, = vT In the electron gas, most scattering processes are inelastic, and the electron gains 
or loses energy each time it scatters. These quantities are defined in terms of the imaginary 
part of the self-energy: 


— = —2 Im[X(k, &)] (5.332) 


2 
— = ———In[X(k, 5.333 
7 = ~ Fp, MEE 6) (5.333) 
Most calculations have been done with the screened exchange energy in (5.56). The rate of 
scattering is proportional to S(q, œ) = Im[1/s(qg, m)]. The scattering has an energy transfer 
© = Exiq — Sy» Where the initial state has €, > 0 and the final state €.,4 > 0 is empty. A 
simple expression for the mean-free-path is 


1 2, @&q THCa, Sk+q — Sk) 
i> ing ams a MSes 29) eee) 


This formula has formed the basis for most treatments of electron scattering in metals. The 
expression is expected once Im[1/&(q, œ)] is identified as the rate of making excitations (q, œ) 
in the solid. 

For small values of initial energy €, it is necessary to numerically integrate this 
expression. It is a double integral, since one must do both the dq and dv (v = cos 9) inte- 
grations. This calculation has been done by Lundqvist (1969). For more energetic particles, 
the excitation spectrum of Im[1/e(q, @)| is dominated by long-wavelength plasmons. One 
way to evaluate this expression approximately is to assume that the plasmons totally dominate 
the excitation spectrum. For small q use the limiting form given in (5.311)}45.314). 

An analytical result may be derived for the mean-free-path. Insert the delta function for 
Im(1/e) into the integrand and change integration variables to p = k + q: 


1 _@@ % O(t O pf p'dpò(E; — ©, — € Df dv (5.335) 
fr, k Poer 1 k2 +p? —2pkv l 
The factor O(€, — @,) arises from energy conservation & — © = &k4q > 0. The integrals 
may be evaluated: 
1 l k t SE_ — Op 
— O(E, — œ, )l 5. 
: =- (2 2) G opin YA TT v pad (5.336) 


The formula is given in terms of the kinetic energy £, measured from the bottom of the 
conduction band, although plasmons are emitted only when & >, or & > œ, + Er. 
Equation (5.336) is almost identical to an old formula of Bethe (1930) for the rate of energy 
loss of charged particles in solids. 

Figure 5.25 shows a comparison of these theories with the experiments in aluminum. 
The metal aluminum is chosen as a basis of comparison because the best experimental results 
are available of any free-electron metal. The data is due to Tracy (1974). The plasmon 
formula (5.336) only needs the aluminum plasmon energy of iw, = 15.5 eV to determine the 
dashed line. The plasmon theory in Fig. 5.25 is too low by about 30-40%. The electron 
mean-free-path is longer than would be predicted by the simple plasmon formula. The 
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FIGURE 5.25 The mean-free-path of electrons in aluminum as a function of energy above the Fermi surface. Circles 
are data points from Tracy (1974). The dashed line with x points is the RPA calculation of Lundqvist (1969). The 
other dashed line is the plasmon theory which ignores pair production, and its predictions are inaccurate. 


plasmon theory is unreliable at lower energies. Experiments show that the rate of making 
plasmons vanishes below 40 eV (Flodstrom et al., 1977). The other theoretical curve is the 
RPA result, which used the calculations of Lundqvist (1969) for r, = 2, which is close to the 
aluminum value of r, = 2.07. He numerically evaluated (5.334). His curve is in excellent 
agreement with the experimental results. Similar measurements have been done on other 
metals, although few on free-electron metals. See the reviews by Echenique et al. (2000). 

Another experiment is to measure the inelastic scattering of electrons by thin metal foils. 
The electrons are shot through the foil, and their cross section for inelastic scattering is 
measured as a function of angle and energy. The electrons must be fairly energetic to 
penetrate the foil. The experiments show that the electrons mostly lose energy by exciting 
long-wavelength plasmons (Marton et al., 1962). 

First calculate the angular dependence of the scattering when the electron emits one 
long-wavelength plasmon. The process is that the energetic electron, usually in the kilovolt 
range, loses one plasmon of energy and is scattered through an angle 0. The energetic electron 
scatters from k to p. The scattering rate may be obtained from (5.335) by eliminating the 
angular integrals. The result is the differential scattering rate as a function of angle: 


2 


dw eo | 5 O(&; — O — €) (5.337) 


dQ 2n Je” “Ye +p? — 2pk cos) 


The delta function determines the value of p. For energetic electrons, this momentum is 


approximately 
O 
p=, |k — 2mo, © k — > (5.338) 
k 
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The typical scattering angle is very small, and it is possible to use the small-angle approx- 
imation for cos@ = 1 — 67/2. The denominator can be simplified to 


K + p* — 2pk cos(0) © (k — p? + kp? ~ k (0 + 6°) (5.339) 
2 _ “p (5.340) 
2E; l 


and the cross section has the simple form derived by Ritchie (1957): 
dw e ©, | 


dQ 2nv, 0? +60? 


(5.341) 
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FIGURE 5.26 Energy loss spectrum at 6 = 0 for electrons going through aluminum films. The relative peak heights 


are shown in (b) and are well fit by a Poisson distribution shown by the solid line. Source: Marton et al. (1962) (used 
with permission). 
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The scattering distribution is Lorentzian. The angular width 0, is typically a fraction of a 
degree when g, is a kilovolt. The angular distribution is sharply peaked in the forward 
direction. This behavior is found experimentally (Sueoka, 1965). 

Another aspect of electron energy loss through metal foils is shown in Fig. 5.26, which is 
the data of Marton et al. (1962) for aluminum. The peak on the left of Fig. 5.26(a) is from the 
electrons which go through the foil with no energy loss. The next peak is from those which 
excite one plasmon, and the nth is from electrons which excite n — 1 plasmons. The angular 
dependence of these peaks was measured and found to be Lorentzian. Another feature of 
these peaks is shown in Fig. 5.26(b), which is the peak intensity, or the area under each curve. 
The background is subtracted according to the dashed line in Fig. 5.26(a). The peak inten- 
sities follow a Poisson distribution (solid line), so that the intensity of the nth peak is given by 


P,=e°—, x = — (5.342) 


The parameter « is the thickness of the film d divided by the mean-free-path for plasmon 
emission. The Poisson distribution is a characteristic of the independent boson model of Sec. 
4.3. A Poisson distribution occurs whenever the bosons, in this case plasmons, are emitted 
independently of previous emissions. This model is reasonable for these fast electrons. The 
electron energy was 20 keV. An electron of this energy does not lose a significant fraction of 
its energy when it emits a plasmon of 15 eV. Similarly, its angle does not change appreciably, 
since the scattering is mostly in the forward direction. The electron trajectory is largely 
unchanged by the plasmon emissions. All emission events have the same likelihood and are 
independent of previous emissions. It makes sense that the independent boson model should 
be applicable here. Of course the zero-temperature version of the theory is used, since even at 
room temperatures, the number of plasmons thermally present 1/(ef% — 1) is negligible. 


Problems 


1. Consider the ground state of the ferromagnetic electron gas. The particles are plane waves, but all 
spins point in the same direction. 


(a) Find the ground state energy of the system in the Hartree-Fock approximation. 

(b) Compare this with the paramagnetic state (equal numbers of up and down spins) calculated in 
the same approximation. Are there values of r, for which the ferromagnetic state is lower in 
energy? 


2. In the Wigner lattice the electron feels a harmonic potential V(r) = —3/r, + r? /r}. Find the zero- 
point energy from oscillations in this well by assuming that each electron moves independently of the 
others. 


3. The correlation energy may also be evaluated by Rayleigh—Schrédinger perturbation theory (Gell- 
Mann and Brueckner, 1957). The second-order ground state energy is 


IV |i)? 
ED =p WIT 34 
g G E-E, (5.343) 


Obtain the ground state energy terms which contribute in second order, and write them down in terms of 
wave vector summations. 
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4. Calculate the equilibrium value of 7,. for metallic hydrogen using NP correlation. Also calculate the 
ground state energy per electron. 


5. Calculate the cohesive energy of metallic helium assuming it is a uniform electron gas, with « 
particles spaced on a crystal lattice. Compare your result to atomic helium. 


6. Consider putting a single point charge Q into a homogeneous electron gas. Show that to order Q? 
the energy required to do this is 


feo 
2=5| (2ny @ Le) 


Estimate this result for the Thomas—Fermi model. 


| + O(Q?) (5.344) 


7. Use (5.117) to estimate S(q) at large q in the RPA. At large q, one has that Im(1/£) ~ —e3. 
8. Use (5.117) to find the behavior of S(q) as q — 0. 


9. Set 1 — g(r) = A exp(—rkr). 


(a) Determine A by the normalization condition (5.71). 

(b) Determine S(q). 

(c) Determine G(q). Compare this on a piece of graph paper with Hubbard and Singwi—-Sölander 
for r, = 3. 


10. Calculate the compressibility of the electron gas from the ground state energy: 


(a) in the Hartree-Fock approximation. 
(b) including both HF and Wigner correlation energy. 


Plot both results on a piece of graph paper as K,/K vs. r, for metallic densities. 


11. Start from the Singwi—Sjdlander expression (5.198) for G,,(q). Take the limit that q —> œœ and find 
their prediction for this limit. 


12. Calculate S(q) in the Hartree-Fock approximation. First show that the starting point is S(q) = 
1 — (2/N) do MMk+q: 


13. Calculate y(r,) in the Hartree-Fock approximation using the results of Problem 12. Verify that the 
correlation energy (5.249) is zero with this choice of y(r,). 


14. Use the form of the dielectric function (5.187) to show that the plasma dispersion relation is 


hq? 4 
a, (q) = @,(0) + (©) + O(q') (5.345) 
m 
_ 3 hk?. 5 dTF 2 


where a is the parameter which enters into the compressibility (5.270). 


15. Prove the ground state energy theorem (5.242) starting from (3.297) using just the electron- 
electron interaction. 


374 Chap.5 e Homogeneous Electron Gas 


16. In the Vashishta—Singwi theory, the constant a in the compressibility is no longer equal to y as 
given by (5.244). Find the new relationship to y. 


17. Does the RPA satisfy the sum rule (5.307)? Show by explicitly doing the integrations. 
18. Calculate ¢,(q, œ) in RPA for a classical gas which obeys Maxwell—Boltzmann statistics. 


19. Derive the sum rule for the third moment for a Hamiltonian of particles in a potential field V(r). 
Do this by evaluating the expectation of the triple commutator with H: 


H= > EPC) Cog + $ PV (q) (5.347) 
o q 
(o°) = (IA, [H, E, AN, pa) (5.348) 


20. Prove that the step in n; at k = kp is Zp. 


21. The screened exchange energy (5.317) may be calculated exactly in the Thomas—Fermi approx- 
imation. Do this, and use your answer to obtain analytical expressions for 


(a) the self-energy at p = 0. 
(b) the self-energy at p = kp. 
(c) the effective mass at the Fermi energy. 


Plot all three results as a function of r, on a piece of graph paper. 


22. Calculate e(q, œ) in RPA for the two-dimensional electron gas at zero temperature. Show that the 
plasmon dispersion can be given by an analytical expression. 


23. Calculate the average kinetic and exchange energies per electron for a two-dimensional electron 
gas with Coulomb interactions e?/r. Define r, by mnyajr? = 1, and your answer should be 
E; = 1/r2 — 1.20/r,.... Find the analytical form of the coefficient of the exchange energy (it is not 
6/5). For a review of correlation calculations, see Jonson (1976). 


24. In two dimensions show that the exchange energy at the Fermi surface is £, = —2e°kp /n. Use this 
result to calculate the compressibility in the Hartree-Fock approximation. At what density r, does the 
compressibility go negative in two dimensions? 


25. Draw the connected Feynman diagrams for the electron self-energy from electron—electron 
interactions that contain two or three internal Coulomb lines. Show that all terms can be written as 
vI /e in (5.317). Which diagrams contribute to I’ and which to £? 


Chapter 6 
Strong Correlations 


The previous chapter discussed metals in which correlation plays a minor role in the 
dynamics of the electron. The properties of the electron gas could be calculated simply and 
accurately. The present chapter discusses other metallic systems in which correlation plays a 
much larger role in the dynamics of the electron. These topics come under the title of strong 
correlation. Strong correlation causes itinerant magnetism. 

The theory of magnetism in metals continues to be one of the challenging subjects of 
modern theoretical physics. Magnetic phenomena are important, and are observed in a wide 
variety of materials. The equations are difficult to solve. A variety of theoretical techniques 
are applied to magnetic phenomena, such as Monte Carlo, renormalization group, and Green’s 
functions. Only the latter technique is discussed here. 

Strong correlation is usually treated by solving the Anderson model or the Hubbard 
model. They are discussed here, along with the Kondo model (1964) which was the historical 
precedent. Most Green’s function analyses have focused on these three model Hamiltonians. 
Each of these three models is not a single mathematical problem. Instead, each has a number 
of variations. For example, the original Kondo model describes a system of conduction 
electrons interacting with a single localized spin. The local spin can have any value of angular 
momentum S, and the conduction electrons can be treated in one, two, or three dimensions. 
Recent variations include the dense or periodic Kondo problem, where there are N local spins 
arranged on a lattice. Another feature of all three models is that they can be solved exactly in 
one dimension, using the Bethe Ansatz, but not in higher dimension. One-dimensional results 
are not a useful guide to collective effects in higher dimension, since there are neither phase 
transitions nor long-range order in one dimension at nonzero temperatures. 


6.1. KONDO MODEL 


For many years experimentalists noticed that magnetic impurities in nonmagnetic metals 
caused anomalous behavior in the low-temperature resistivity p(T). Magnetic impurities are 
those with a moment caused by partially filled d- or f-electron shells. An example is 
manganese impurities in copper. The electron scattering from a nonmagnetic impurity makes 
a contribution to the resistivity that is independent of temperature. However, a magnetic 
impurity causes a resistance minimum at a nonzero temperature. Kondo explained this 
behavior as due to spin-flip scattering between the conduction electrons and the localized 
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spin. The resistance minimum is now called the Kondo effect. The Kondo model is described 
by the Hamiltonian 


J A Rk z 
H = Y EkCkoCko — nae” CECE, Cpr — Cih Cp) SP 


+ CL, Cp St PA 57] (6.1) 


The first term represents the kinetic energy of the conduction electrons. If the local spin is 
from a d-shell, the conduction band is usually formed from atomic orbitals that have s- and p- 
symmetry. If the local spin is from an f-orbital, then the conduction band could be from s, p, 
or d electrons. 

The remaining terms represent the scattering from the local spin at the site R,. The 
number of local spins is N,, and the concentration is c = N,;/N, where N is the number of 
atoms in the solid. Usually it is assumed that c is quite small. The electron self-energy is 
calculated to order O(c). This approximation is equivalent to assuming that the conduction 
electron scatters from one impurity at a time, or that the impurities are widely separated. 

The interaction term can be written compactly as s- S, where the small s is the spin of the 
conduction electron, and the large S is for the localized spin. The first scattering term in (6.1) 
is the s®S@ component. For s® we have plus one (Cy Cy) when the spin of the conduction 
electron is up, and minus one (— cÌ c Cp) when it is down. The other two terms in the 
scattering interaction represent spin-flip processes, where there is a mutual spin flip between 
the conduction and local spin. The conduction electrons are assumed to be electrons with spin 
one-half. The local spin can have any value permitted by quantum mechanics. Here it is taken 
to have spin S with magnetic quantum number m. Both S and m are either integers or half 
integers, where m has the range of values —S < m < S. 


S®\m) = m|m) 
SO m) = [S(S + 1) — mim + 1]? |m + 1) (6.2) 
Sm) = [S(S + 1) — mm — 1)]'"|m — 1) 


The operators S} raise or lower the magnetic quantum number m. The interaction term in 
(6.1) can be written compactly as 


Va = = > exp[iR; (k — p)]S ag SCL, Cop (6.3) 


kpaB 


The indices « and B denote the spin of the electron after and before the scattering event. The 
vector © is a Pauli spin matrix for different x, y, and z components. If the meaning of the 
terms in (6.3) is unclear, just consult (6.1) for the actual formula. 

The constant J is called the “exchange energy.” Values with J > 0 are called “ferro- 
magnetic,” since the local spin tends to line up parallel with the conduction band spins. Values 
with J < 0 are called “antiferromagnetic,” since the local spin tends to line up antiparallel 
with the conduction band spins. This nomenclature is dependent upon writing the Hamilto- 
nian with a negative sign before the interaction term. Most magnetic impurities have J < 0. 


6.1.1. High-Temperature Scattering 


The self-energy is calculated for an electron scattering from the impurity. The local spin 
and the conduction band spins are assumed to have equal probability for any value of m. 
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These assumptions are valid at high temperature. Whether the temperature is high or low 
depends upon a reference temperature called the Kondo temperature Tę, which will be 
defined below. 

The interaction is (6.3). The average over impurity positions is taken following the 
prescription in Sec. 4.1.4. The first term is found in first-order perturbation theory. The 
average of V,, over impurity positions forces k = p. The first-order self-energy is a constant 
which is independent of wave vector or energy: 


dy = —CJ (O y ° S) (6.4) 


The notation o,,, means to take the z component. It is usually assumed that the system is 
nonmagnetic. Then the average value of o% and S® are both zero, since there are equal 
numbers of states occupied with spin up as with spin down. Then the above self-energy 
expression is zero. 

Next consider the second-order scattering. The n = 2 term in the S-matrix expansion for 
the Green’s function is 


Gk, 1) = f an | dr, (TC y(t) altra!) 


=- [ an | do © E oo (TSP) 
2 ss'uu' kı kp) p> 


X (Tye (t) Ch, g(t) Cp, (TI Che u(t) Cp wT) Cg (0))8(K, — kz — pı + Pz) (6.5) 


The last bracket contains three raising and three lowering operators for conduction electrons. 
One can pair them up using Wick’s theorem. The correlation function of the local spin is not 
treated as easily. Local spins are neither fermions nor bosons. The thermodynamic average is 
obtained by averaging over the states |m) with different magnetic quantum number. If there is 
no external magnetic field, then each state of different m is equally likely, which is assumed 
here. 

The major problem with evaluating the correlation function of the local spin is that they 
do not obey a Wick’s theorem. Here it is no problem, since there are only two operators, 
which must be paired. However, the lack of Wick’s theorem is a major algebraic difficulty in 
higher orders of perturbation theory. The S“) operators must be paired: each S) must be 
paired with a SÌ. Each bracket can have an arbitrary number of S® operators, and they are 
not paired. The correlation function for the local spin has three possible combinations which 
are evaluated using (6.2): 


(TS (t )SO(t)) = S(S+1)—m* —m sen(T,; — T>) (6.6) 
(TS (1, )SO(t)) = S(S + 1) — m? + m sgn(t, — t2) (6.7) 
(TS(t,)S(t)) = m? (6.8) 


The only t dependence is in the order of the operators. At this point it is useful to average over 
the values of m. Then the term linear in m vanishes, and the first two of the above expressions 
become identical. 

Assume that the initial spin « of the conduction electron is +. If the scattering does not 
flip the spin, the intermediate and final states are also «. Then the local spin correlation 
function has the form (S@S®). The other possibility is that the spin flips so that the inter- 
mediate state has the opposite spin to x. Then one must use one of the two combinations with 
S) and SO). The Green’s functions for the conduction electron are the same for the two 
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cases of spin-flip or no spin-flip. Just add these two cases, which causes the m? terms to 
cancel. The second-order self-energy for the Kondo model is 


2- (ik,) = cS(S + I ge (6.9) 
This self-energy expression has the same form as ordinary impurity scattering in the second 
Born approximation. The spin enters only as an effective interaction of S(S + 1)J*. Early 
workers in the field stopped here, and concluded that spin-flip scattering provided no inter- 
esting behavior. This conclusion is wrong, since the unusual effects are found in the higher 
orders of perturbation theory. 

The self-energy of the conduction electron is now calculated in the third Born 
approximation, which repeats Kondo’s original calculation. The anomalous behavior which 
gives rise to the Kondo effect becomes evident. The n = 3 term in the Green’s function 
expansion is 


cJ? B B B 
G°)(k, t) = -S >| dt, l d | dt (k, t — T,)G(p, Ti — T2) 
N* pa Jo 0 0 
x Gq, T2 — 13)G(K, T3)L(T1, T2, T3) (6.10) 
L(t), T2, 13) = 2 oV oL oR TS SO) (3) (6.11) 
Lv 


The self-energy must be the same for spin-up and for spin-down electrons, so set « = + for 
spin up. The (s, s’) can be either of the four combinations (+, +), (+, —), (—, +), (—, —). For 
x = + then each of these four combinations of (s, s’) has a unique choice for (vptA). Since the 
electron Green’s functions do not depend upon the spin label s, the summation over (s, s’) is 
included in L: 


L(t, T2, 73) = (TSP SPSO) + (TS (4,)SO (SM (43) + (TSO (4) SMP (1) S (13)) 
+ (ISOS SP) (6.12) 


No time variables are written in the first term on the right since this expression is independent 
of the t variables. This term equals m? for any t ordering. The other terms depend upon the t 
ordering. The factor of S® is always to the right of S. This choice follows from the ansatz 
that « = +. Spin-flip scattering must lower the spin of the conduction electron, while raising 
the spin of the local electron. The local raising operator is S which must act at the earlier 
time. The factors o,,, are not written for each term. They are always plus except for one case. 
When the factor S® immediately follows in time (t) the operator S“) then a minus sign is 
inserted. The minus sign arises because the factor SP raises the local spin while flipping the 
conduction spin from up to down. The o® operates on the down conduction spin and gives 
minus one. 

The above expression must be evaluated for all six arrangements of t ordering. The 
ordering T] > Ta > T3 gives 


L = m +[S(S +1) — m? —m][m+m—-(m+)] (6.13) 
L= m|S(S+1)+1]— S(S +1) (6.14) 
The terms are written in the same order they occur in (6.12). The minus sign before the factor 


of (m + 1) is due to the sign reversal mentioned above. The term proportional to m vanishes 
when we average over m. After averaging, the above expression equals —S(S + 1). 
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Next consider the t-ordering of t3 > tT, > t}, which gives 


L=m? +[S(S +1) —m + m][m—m+(m+1) (6.15) 
L = m{S(S + 1) +1] + SWS +1) (6.16) 


After averaging over m this term gives S(S + 1). This result is similar to the one above, except 
that the sign has changed. One can work out the other four cases. The three cyclic 
arrangements T; > T} > T3, T2 > T3 > T], and T3 > T} > T, each have L = —S(S + I). The 
other three arrangements 1; > T) > Ti, T2 > T] > T}, and 1, >T} >T, each have 
L = S(S + 1). The three t integrals in (6.10) must be broken up into the six separate regions 
of different t ordering. Three of them are multiplied by S(S + 1) while the other three are 
multiplied by —S(S + 1). 

The notation (123) means tT, > T, > 73. In this notation the following result is obtained 
for L after averaging over values of m 


L = —S(S + 1)[(123) + (231) + (312) — (321) — (213) — (132)] 
1 = (123) + (231) + (312) + (321) + (213) + (132) 


The second line above expresses the idea that the summation of all possible time orderings is 
just the unity operator. This result can be used to rewrite L as 


L = —2S(S + 1)[(123) + (231) + (312) — # (6.17) 


The amount of work has been cut in half, since now only three time orderings need to be 
evaluated. 

The third-order Green’s function of frequency is evaluated by multiplying Eq. (6.10) by 
exp(ik,) and integrating dt between (0, B). The integrals are most easily evaluated in reverse 
order: 


| J? p} k,- 
GEk, ik,) = —cS(S + Na | dr,e 0) Gk, t3) 
B B 
Ne | dre O-VG(p, t -7| dre) G(q, t, — ty) 
0 0 


BO 
x l dret D-n) Gk, tT — T, L(t, T, T3) (6.18) 
0 


In Eq. (6.17) the last term in brackets is L, It has no limits on the order of the t integrals. Then 
all of the above integrals have their usual limits of (0, B), and the above expression is 

cJ? l 

—S(S + )— MAM, ik, f Y ———_____—~ (6.19) 
N? 2 (ik, — &)(ik, — €q) 


This expression is a contribution to the third-order Green’s function. If one eliminates the 
prefactor of 9 (k, ik,)’ then the remaining expression is a contribution to the third-order 
self-energy. It is just the third term in the 7-matrix equation generated by iterating (4.49). This 
contribution to the self-energy is rather dull. 

The Kondo effect comes from the terms in L which restrict the order of the t integrals. 
The restrictions are on T4, Tə and t3. There is no restriction on the dt integral. It can be done 
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immediately, and yields (k, ik,,). The first interesting integral is (123), whose limits can be 
written as 


(123) > f dt; [ dt i dt, (6.20) 


T3 T2 


Doing the integral in (6.18) with these limits gives the following result. Drop all of the 
prefactors and summations, and list only the results of the three t integrals: 


[1 — ni — aa) [1 — n(g)Iln(&) — np) 
(ik, — & Mik, — & (ik, — 84) Uy — &pV(Ex — Ep Ep — €q) 
[1 — nil - na) 


—— 6.21 
+ Gk, Eer — E)E, — E) (6.21) 


The expression has a multitude of fermion occupation factors n(k), etc. Similar expressions 
are obtained for the other two t orderings (231) and (312): 


(123) = 


B B T2 
(231) > | ars | ar, | dt, 
0 


0 T3 


ap- | [th ning) = no) -on -na 
(ik, 7 E, )(E, 7 E4) Ep — Ek Eq — Ex 
MBM = nl), mnpIL1 = ng) = mgt = (BI = nD) gap 
ik, — €} ik, — Ep — Ek — Ep + Eg l 
B T3 T3 
(312) = | dt | dt I dt 
(312) = H-rik- aA O oln 
(ik, — E, (ik, 7 E, (Ep g Ex) (ik, 7 E, (ik, _ E, )(E, ~ Eq) 
EMPL — na) = ng = n(b = ro) 629 


(ik, — &p)(iky — Ek — Ep + &g)(Ep — Eq) 
An interesting thing happens when we add these three expressions. Most of the terms cancel: 
(123) + (231) + G12) = 
l l n(p) n(q) 
-— |e a ___|_ 6.24) 
(ik, — €,)° E ~ E, (ik, ~ E,) (ik, ~ E, NE, _ €p) (ik, — E, ME, 7 E4) 


A further simplification occurs by noticing that the last two terms are identical after inter- 
changing the variables of integration p and q. The result for the third-order self-energy is 


oq (ik, — Eq) | (ikp — €p) €p — €q 


X (ik,) = —S(S + ye yl la An(p) | (6.25) 


The final expression is rather simple. There are two terms. The first term in the brackets is the 
rather dull term, which is the third-order Born scattering from a simple potential. The second 
term in the brackets was first derived by Kondo. He used it to explain the temperature 
dependence of the resistivity, which is now called the Kondo effect. Today it is known that 
similar anomalous terms occur in all higher orders of perturbation theory. The modern theory 
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of the Kondo effect employs terms from all orders of perturbation theory. Nevertheless, 
Kondo’s simple arguments will be presented. 

The self-energy expression has a factor of the fermion occupation number n(p). These 
factors do not occur in scattering from a simple potential. They occur here because of the 
spin-flip scattering. Interesting physics happens whenever they occur, such as in the BCS 
theory of superconductivity or in the edge singularities in X-ray absorption in metals. 

The second term in the above brackets produces a term in the resistivity that goes as 
In(T), where T is the temperature. A quick and sloppy derivation is presented of this 
behavior. The conduction band density of states is defined as g(s). It is assumed to be a 
smooth function of energy £, which extends from W < £ < B. Summations over wave vector 
are changed to integrations over energy using the prescription 


B 
yy) 3 l deg(e)f(e) (6.26) 
NS w 


A typical alloy system that shows the Kondo effect is manganese impurities in copper. The 
density of states g(£) of conduction electrons in metallic copper has a large energy depen- 
dence because of the occupied d-bands. However, the conduction electrons that are involved 
in the Kondo effect have energies within several k,;T of the Fermi energy. On that small 
energy scale, the density of states of copper and most metals is smooth and featureless. The 
custom is to call the density of states g(e) a constant g(0), where £ = 0 signifies the Fermi 
energy. 


For the Kondo effect, the interesting third-order self-energy is 


B B 
Lap(ik,) = —45(S + 1)oJ? l de, | de, EED") (6.27) 
-W ik W €] — E> 


n lJ- 


The ultimate goal is to calculate the resistivity. Then the quantity of interest is the scattering 
rate, which is related to the imaginary part of the self-energy. In this expression let 
ik, > œ + in, where n is infinitesimal, and then take the imaginary part. The first energy 
denominator gets replaced by —n5(w — €,). The first integral gives —mg(@). In order to 
evaluate the second integral, it is assumed that the temperature is low, so the fermion 
occupation factor can be replaced by the step function n(e,) ~ ©(—«,). The most important 
term is the logarithmic singularity which comes from this Fermi cutoff 


| a SOLON f ap sE 
-W W 


0) 
X In =] . 
WO — £ W — E> glo) r (6.28) 


Since the density of states g(w) is a smooth function of energy œ, this quantity is usually 


replaced by g(0). The standard expression for the third-order self-energy in the Kondo effect 
1S 


Im[Z3,(o)] = —40S(S + 1)cJ?.g(0) in| (6.29) 
The real part of 23, has an uninteresting œ dependence near œ = 0. 
This result is combined with the larger term from the second order of perturbation 


theory: 


Im[X(o)] = —nS(S + Ies*g(0)| + 4Je(0) in| | ' | (6.30) 
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In calculating the conductivity, the energy œ of the electron is averaged over a region near the 
Fermi surface of width kgT. This replaces In|m/W| by In(kgT/W). Kondo derived a 
temperature-dependent resistivity of the form 


p(T) = p(0)[1 + 4Jg(0) In(kgT/W)] + bT? (6.31) 


The first term p(T = 0) is the temperature-independent part of the impurity scattering. The 
second term with In(7) is the Kondo effect. The last term ~ T? is the low-temperature 
contribution from electron-phonon scattering. Usually the experiments are done in metals or 
alloys where phonon drag does not eliminate the T° law. 

The factor of In(kgT/W) is negative at low temperature. If J < 0 then the Kondo term is 
positive. At very low temperatures, the resistivity increases in value, since this term becomes 
large. This expression predicts a minimum in the resistivity, which is observed in Kondo 
systems. Define Tin as the temperature at the minimum resistance. By taking dp/dT = 0 one 
finds that (Torin)? ~c, where c is the concentration of impurities (Note p(0) œ c). This 
relationship is in agreement with experiments on Kondo alloys. A typical measurement of 
p(T) is shown in Fig. 6.1 for the system of La,_,Ce,B,. The resistance minimum is evident. 
The resistance increases at low temperature, and finally saturates. The reason for the 
saturation is given in the following section. 

The theory presented in this section is a high-temperature theory. A key assumption in 
the derivation is that the local spin has equal energy whether it is up or down. At low 
temperature, this assumption ceases to be valid. Each local spin becomes locked into a 
collective state with the conduction band spins. This collective state has a binding energy, 
which must be overcome during a spin-flip. The spin-flips become frozen out at low 
temperature, and the Kondo effect saturates. This process is described in the next section. 
Kondo’s formula (6.31) is only valid at temperatures high above the formation of the 
collective state. 

Our derivation of the Kondo resistance was sloppy for the following reasons. In eval- 
uating the integral over de, the Fermi factor n(€,) was replaced by a step function ©(—¢)). 
This approximation is only valid at zero temperature. Later in the derivation, we averaged 
In|o/W| over the distribution of electrons while assuming the temperature is nonzero. A 
better derivation would have evaluated the integral over de, at nonzero temperature, which 
broadens the Fermi distribution a few degrees in energy. Of course, this realism makes the 
integral harder to evaluate. 
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FIGURE 6.1 Total resistivity vs T for LaBę and La,_,Ce,B;. The Kondo temperature is Tg = 1.05 K. Source: 
Samwer and Winzer (1976) (used with permission). 
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6.1.2. Low-Temperature State 


At low temperature the local spin forms a collective state with the conduction band 
spins. The collective state is formed from a linear combination of conduction band states, and 
is a many-particle entity. Earlier chapters have often discussed the role of screening charge in 
reducing the long range Coulomb interaction in conducting systems. The screening charge is 
not from a single electron in the conduction band, but usually is formed by a linear combi- 
nation of conduction states. The present collective state can be viewed, in a similar way, as a 
screening effect. However, now it is screening the spin of the local state, rather than its charge. 
If the impurity is charged there could coexist the screening clouds for spin and charge. 

For J < 0 the state is a singlet. In order to keep the discussion simple, let us assume that 
the coupling is antiferromagnetic (J < 0) and that the local spin has S = L. The up and down 
states of the local spin are denoted as a and B. Yosida (1966) first considered the collective 
state. Its wave function can be written in two possible ways: 


1 

Vou = ay aylaC], — BCL, IF) (6.32) 
1 

Vor = JN ox, by lOCyy + BC IF) (6.33) 

IF) = IM pier, Cop Ch, 10) (6.34) 


The symbol |F} denotes the filled Fermi sea of electrons. The state Y,a forms the collective 
state by adding an electron above the Fermi sea. It has the usual form for a spin singlet, as an 
antisymmetric combination of up and down states of the local and conduction spins. The 
coefficient a, is a parameter that needs to be determined. 

The second choice of collective wave function Wo, needs further explanation. Here one is 
forming the collective state by taking linear combinations of hole states, where the hole is a 
missing electron from the filled Fermi sea. The easiest way to understand this wave function is 
to examine one of its terms, which include the relevent factor from the Fermi sea: 


[eCity + BCK ICK, Ck, 10) = [Ck — BCY, 110) (6.35) 


The right-hand expression is obviously a spin singlet, composed of an antisymmetric 
combination of down and up spin states. The relative plus sign on the left becomes a minus 
sign on the right. The two states Wo, and Wo, are orthogonal. Each describes a different 
collective state, and the system will choose the one with the lowest energy. One is a ground 
state, and one is an excited state. 

The first term in (6.1) is called Hp and the second term is Vq. If there is no local spin, the 
ground state energy for this model Hamiltonian is 


Hy|F) = EglF) (6.36) 
Eg = 2 > Ek (6.37) 
Ik|<kp 


Now add one local spin at the point R, = 0. In this calculation, one does not average over the 
spatial positions of the impurity, and one does not set p = k in the interaction term. Only one 
impurity site is being considered. 
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The goal is to calculate the additional energy ðE < 0 gained by the system because of 
the formation of the collective spin state. It is analogous to the energy found for charge 
screening using Fumi’s theorem in Sec. 4.1. The eigenvalue equation is 


{Ho + Vsa} Voa b = (Eg + SEa, b)Vo,a,b (6.38) 


This equation is not strictly valid since neither Yo, nor Wo, is an exact eigenstate of the 
Hamiltonian. In order to understand this assertion, consider the effect of one term in Vya 
acting upon one term in Wo, 


Ci CuSO E aah IF) (6.39) 
p.q Ik|>kr 


The spin-lowering operator S changes « to B. If q = k there is a term 


2 a| 2, BC ma (6.40) 
|k|>Ap Ipl>kr 


which is recognized as being similar to one of the other terms in Yoa. However, there are also 
terms in (6.39) in which k, p, and q are all different wave vectors. These terms describe a state 
with one hole and two excitations above the Fermi surface. An electron-hole pair has been 
excited in addition to the original excitation. This state is perfectly valid, since it is generated 
by the interaction term in the Hamiltonian. Our failure to include it in the ansatz eigenstate 
Wo, is simply an approximation. A more accurate ansatz would include these additional terms. 
There are an infinite number of them, which correspond to the multiple excitation of electron— 
hole pairs. The more pairs that are included, the more accurate is the wave function. 

The best way to understand the ansatz wave function in (6.32) is to consider it a 
variational calculation on the ground state. The coefficients a, and b, are the variational 
parameters. The ground state energy is calculated, using Dirac notation, with j = a or b, as 


(WojlH|Wo,) 
(WojlWo;) 


After evaluating the right-hand side of this expression, say for j = a, one finds the minimum 
value of òE, by taking the functional derivative 5(6E,,)/da,. There results an equation for a, 
that can be solved. These steps produce the minimum ground state energy for the ansatz 
(6.32). 

The method of solution is equivalent to the variational procedure, but is algebraically 
simpler. Equation (6.38) is evaluated. The multipair states are neglected, and only project out 
the terms with one conduction band excitation. For example, when solving for Wo, define 
(F|aC,, as the Hermitian conjugate of aC} |F) and evaluate the expression 


(F\aCp, {Ho + Vsa — Eg — Ea} Voa = 9 (6.42) 


As an example, the easy terms are done first: 


{Hy — Eg — òEa}Yoa == X aplEc + & — Eg — 8EgllaCy, — BCL, IF) 


l 
JN Ik| >Kp 


(F|Ck, {Ho — Eg — Ea Woa = (6.43) 


l 
JN Ay LE, _ OF, | 
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A similar evaluation must be done for V,;. One term was previously evaluated in (6.39). 
Another term in V,iWo, iS 


E SCi, Car — Ch Cay) >> ayaC} IF) (6.44) 
p,q >kp 


The operator S® acts upon « and gives «/2. The electron operators give several contributions. 
For p = q the part in parentheses gives the number of electrons with spin up minus the 
number with spin down. The state Ci k|) has one more electron with spin down, so the 
summation gives — 1. There is also a term where q = k. These two contributions are 


—} 5 2 ayaCy. lF) -3| > «| > iP) (6.45) 
* kis% F |k|>Ap [pl >A 

Other terms contain an electron—hole pair, and these contributions are being ignored. In the 

above expression, the first term can be neglected since it is of order O(1/N) compared to the 

second. The second has a double summation, while the first has only a single summation. By 

employing these kinds of approximations, there results 


VaVoa = = al >, a > ACON — BCI JF) (6.46) 
Ik|>kr Ipl>kr 
This result is combined with (6.43) to produce the equation for a,: 
3J 1 
dk = > a (6.47) 


ON Ek — &, — ôE, Ip|>kr P 


This expression is then summed over k. These summations are changed to integrals over the 
particle energy. The eigenvalue equation is 


l =— 


B 
>| de 2) (6.48) 
0 


€ — oF, 


The solution to the wave function Y, follows the same steps used to obtain Yoa. The form of 
the wave function in (6.35) has the same form as (6.32), so that all steps in the derivation are 
similar. There are two minor differences. First, since |k| < kp, the integral over energy is over 
negative energy states. Second, the result of HjWo, has the factor of (Eg — €x); the minus sign 
preceding €, comes from the fact that Wo, has an electron missing from the Fermi sea. This 
changes the sign of £ in the integral. So the eigenvalue equation for Wo, has the form 


3J 1l 
b, = =——— b 6.49 
k — 2N e, + ôE, >. p (6.49) 


A g(é) 
l = — E€ 
2 Jw E€+òôE, 


(6.50) 


The two eigenvalue equations (6.48) and (6.50) are very similar. The major difference is that 
one has an integral over the empty states in the conduction band, while the other has an 
integral over the occupied states. Since dE, , < 0 neither energy denominator is singular. 
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These two expressions will be evaluated for a simple model of the conduction band. The 
density of states g(€) equals a constant g. Then the two integrals give 


1 = GJg/2) In W 8h, (6.51) 
1 = (3/g/2)In\- a (6.52) 
which have the solution 
of, = — LAI (6.53) 
òE, = — EAR (6.54) 
2 
In(A) = 3Je (6.55) 


The largest value of 8E, , is determined by whether B or W is largest. If the band is less than 
half full, then W < B, 8E, > 5E,, and Wo, is the ground state. If the band is more than half 
full, then W > B and Wo, is the ground state. If the band is exactly half full, the states are 
degenerate. These results only apply to the case of a constant density of states. Other 
examples are assigned in the problems. 

Remember it is assumed that J < 0. In fact, there is no solution for singlet states when 
J > 0. The present approximations only make sense when E < (B, W) so that the logarithm 
terms in (6.52) are negative. Then the equation has a solution only for J < 0. For anti- 
ferromagnetic coupling, the factor of A = exp(2/3Jg) is generally much less than unity. This 
expression also has the feature that the coupling constant J appears as an inverse power in an 
exponent. It would be impossible, or at least very difficult, to derive an expression for OF 
from perturbation theory. 

This mathematical form, of an inverse power in an exponent, will be encountered in 
Chap. 10 in the BCS theory of superconductivity. The energy gap and the formula for T, have 
a similar factor of exp[—1/N-Vol. 

The Kondo temperature is defined by the expression 


kpT x = W exp] - (6.56) 


1 
2g(0)|7 | 
This expression is similar to the result for the change in ground state energy. The main 
difference is the factor of f in the exponent has become L, The reason for this change is given 
below. A general definition is that the change in ground state energy defines an energy scale 
that also defines a characteristic temperature. The Kondo temperature is also very close to the 
temperature at which the Kondo effect becomes important in the resistivity. The self-energy 
correction given by the second term in (6.30) becomes equal to the first at a temperature of 
T ~ Tx. It is not the same temperature as the resistance minimum, since the minimum 
depends upon the concentration of impurities. The Kondo temperature defines an energy scale 
for a single impurity. Values of Tẹ for actual alloys span a large range in temperature, from 
107° to 10° K. 

So far the derivation suggests that the collective state has a similar behavior regardless of 
the sign of J. However, further work has shown that the collective state is not formed for 
ferromagnetic coupling (J > 0). If further terms are evaluated in the perturbation expansion, 
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they destabilize the collective state for J > 0. For J < 0, further terms in the perturbation 
theory continue to predict a collective state. The Kondo effect does depend upon the sign of J, 
and exists only for J < 0. 


6.1.3. Kondo Temperature 


The two prior sections discussed the high- and low-temperature regimes of the Kondo 
effect. The most interesting behavior occurs for temperatures near the Kondo temperature Ty, 
which is defined in (6.56). A short and somewhat qualitative discussion of this temperature 
region is presented here. This region is actually very well understood because the exact 
solution to the Kondo problem is now known. See the solutions by Andrei et al. (1983) and 
Wiegmann (1981). 

The high-temperature scattering theory has been carried out to much higher orders of 
perturbation theory. Abrikosov (1965), and Silverstein and Duke (1967) have suggested that 
the perturbation expansion for the retarded Green’s function can be represented by the simple 
function 


cJ 


Fra) = RO oW OSS + D 


(6.57) 
The function calculated in Sec. 6.1.1 was the imaginary part of the self-energy. The imaginary 
part of the above expression is 


tcJ*2(w)S(S + 1) 


mÈ elo] = ————_— > ee 
[1 — 2Jg(w) Injo/WI]" + [goss + 1)] 


(6.58) 
If the denominator is expanded in a power series in Jg(w), the first term is the perturbation 
expression in (6.30). Another feature of this expression is that the denominator has a reso- 
nance when the energy œ ~ W exp[1/2/2(0)|, which makes sense when J < 0. There is no 
resonance for J > 0. This observation provides further evidence that there is no collective 
behavior for J > 0. 

A factor of 2./g(w) can be taken out of the denominator in (6.57). Using the definition of 
the Kondo temperature in (6.56) allows the real part of the denominator to have a simple 
form: 


ret \O) = 29(@) Injo/kgTx| + imS(S + 1)/2 (6.59) 
miZ ,(o)) = ctS(S + 1)/42(@) 


In” |@/kpTx| + [SCS + 1)/2} 


The interesting feature of this expression is that the œ dependence of the self-energy scales 
only with the Kondo temperature. One should be able to describe all of the self-energy effects 
by this single parameter. It also has a resonance behavior. The square of the logarithm has its 
smallest value of zero at ®, = kgTg, and the denominator increases in magnitude for values 
of œ different from @,;. 


6.1.4. Kondo Resonance 


Wilkins (1982) observed that many of the properties associated with the Kondo effect 
can be described by a simple expression. The system acts as if the density of states has a 
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resonance at the Fermi surface for each impurity with a local moment. Let the concentration 
of impurities be c; = N,/N. A simple expression which well describes the density of states is 
Ci 


Tol = 2o) + 5g) (6.60) 


grlo) = g(@) + 
where y = 1.6kg7,. The change dg(@) integrates to one electron for each impurity with a 
local spin. 
For example, to calculate the change in the specific heat 5C due to the change dg(), 
first find the change dU in the internal energy 


dC = sy = c; f(T/Tx) 
oT (6.61) 
ðU = | doonp(a)8e(o) 


Figure 6.2 shows how the function f(T /T,) (dashed line) compares to the exact result 
(solid line). The two results are quite similar. Note that the temperature scale spans several 
decades. The specific heat per magnetic impurity is a function only of T/Tg. At high 
temperature the specific heat per impurity goes as f ~ Tg/T. At low temperatures it goes as 
f ~T/Tx, which can be large if the Kondo temperature is low. Wilkins observes that the 
entire effect of the many-body interaction is to add a Kondo Resonance at the Fermi surface, 
such as the term 6g(@). Numerical calculations of the Kondo resonance show that it is usually 
asymmetric in frequency and is not centered precisely at œ = 0. 

The resonance behavior also affects the resistivity. Assume that the Kondo resonance 
affects the quasiparticle self-energy I’ = Im[2]. Instead of the behavior predicted in (6.59) use 
the resonance form in (6.60). The temperature-dependent conductivity o(T) is given in terms 
of the lifetime t(T) 


2 
o — wet (6.62) 
m 
°° dn;(@)| 1 
— EN | 6 
T B dol To | aT (6.63) 
ry 
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Pa 
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FIGURE 6.2 Specific heat per impurity for the Kondo model. The solid line is the numerical solution using the 
renormalization group. The dashed line is the calculation using the Lorentzian resonance with unit weight and width 
equal to 1.6Tg. Adapted from Wilkins (1982) (used with permission). 
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FIGURE 6.3 Resistivity vs temperature in the Kondo model. 


where Og = noe?t,)/m is the conductivity from nonresonant impurity scattering for T >> Tr. 
The parameter r comes from the ratio of resonant to nonresonant impurity scattering. The 
frequency integral in (6.63) can be done numerically. The result is given in Fig. 6.3, for 
r = 10, which shows the resistivity p(T) = 1/o(T). At high temperature (T >> Tp) one finds 
= Py. At low temperature (T < Tx) one finds that p = p,(1 +r). This behavior of the 
resistivity is similar to the experimental dependence shown in Fig. 6.1. At high temperature 
the resonance effects from spin-flip scattering are smeared out by thermal broadening. As the 
temperature is lowered to the vicinity of Tg, the thermal smearing decreases, and the Kondo 
resonance becomes more important in scattering the electron. The resistivity increases. At still 
lower temperatures, the collective state begins to form, and the local spin becomes harder to 
flip. Then the resistance saturates, and no longer increases with decreasing temperature. 


6.2. SINGLE-SITE ANDERSON MODEL 


The Anderson model (1961) was introduced in Sec. 1.4. It is another model for a system 
of conduction electrons that interact with a local spin. Equation (1.354) gives the transfor- 
mation of Schrieffer and Wolff (1966), which shows that the model has some terms that are 
similar to those of the Kondo model. Early workers thought that the two models made very 
similar predictions. Now it is known that the Anderson model has a greater variety of 
behavior. It has the more interesting physics. 

The Kondo model treats the local spin as a separate entity. The Anderson model treats 
the local spin as just another electron. It can undergo exchange and other processes with the 
conduction electrons. The Anderson model is more realistic. 

The Anderson model is written here with a slightly altered notation. Most applications of 
the model consider the localized state to have spin and orbital degeneracy. It could be a d 
electron of 10 states or an f electron of 14 states. Usually the spin—orbit interaction splits these 
states into 6 and 4, and into 8 and 6, respectively. The crystal field may cause further 
splittings. Let N; denote the actual degeneracy of the local level: it may be an even number 
such as 2, 4, 6, 8, 10, or 14. One of the theoretical developments is a perturbation expansion, 
where the expansion parameter is 1/N;. Obviously this expansion works better for larger 
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values of N;. Many recent applications of the Anderson model have been for heavy fermion 

systems, where the local orbital is one or several electrons in f orbitals. That is the origin of 

the subscript f on the factor of N. The symbol N without the subscript is the number of 

atoms sites in the solid. For reviews of heavy fermions see Stewart (1984) or Lee et al. (1986). 
The Hamiltonian is written as 


Hy => &,CiyCry + & nm, + UY ayn, (6.66) 
v u u>v 
ny = fafu (6.67) 
1 
V = — Y Vn ific, +c! 6.68 
Tn vv k iw Al ( ) 


where V is the hybridization interaction between the band electrons (k) and the localized 
electrons /,. The onsite Coulomb repulsion is U and gp is the eigenvalue for a single f- 
electron. The summations over (p, v) run between 1 and Ny. This index denotes a combi- 
nation of orbital and spin states. Only the band states (k, v) with the same index interact with 
the local orbitals. The wave vector k can be represented by the magnitude k plus various spin 
and angular momentum states. The latter are represented by the index v. An important point is 
that of all the many symmetries available to the conduction band states, the only ones which 
couple to the local state are those values of v between 1 and N;. The summations over k will 
eventually be changed to an integral over the energy £. The density of states is g(€) and the 
matrix element as a function of energy is written as V, = V(é). 

How does the local spin get flipped in the Anderson model? Actually it does not flip. 
Because of the hybridization term, a local spin that is down can become a conduction state 
with spin down, and then wander away. Later, a different conduction electron with spin up can 
come and reside in the local orbital. This process gives the appearance of the local spin having 
flipped from down to up, while conduction states have flipped from up to down. Now one can 
see the important role of the electron—electron interaction U. The spin-flip process has two 
steps: (1) the departure of the old local electron with spin down, and (2) the arrival of the new 
local electron with spin up. If U = 0 these two steps are totally independent and can occcur in 
any order. There is no appearance of flipping, since down and up electrons come and go 
independently. However, once U >> kpT, then the two steps become correlated. It is ener- 
getically unfavorable to have an up and down spin electrons both on the local orbital. The two 
steps tend to become sequential: one leaves before the other comes. Then the local spin 
appears to flip. 

The important parameters of the Anderson model are the bandwidths W, the conduction 
band density of states g(e), the local site energy €,, and the on-site Coulomb interaction U. 
There is also an energy parameter associated with the hybridization term which is defined as 
I (e): 


r(e) = wÈ V28(E — £4) (6.69) 


= nV (e g(£) (6.70) 


This function is assumed to be a constant, independent of energy. Its value for electrons in 
f-orbitals is about 0.1 eV. 

Most theories have been calculated for the case that the ground state has only one 
electron in the f-orbital. This corresponds to the case of cerium triply ionized (Ce**). Typical 
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values are U ~6eV, T <0.1 eV, and ep ~ —0.5eV (Herbst and Wilkins, 1987). The zero of 
energy is taken to be the chemical potential. Since the spin—orbit splitting is large, one uses 
only the lower state which has N, = 6. The energy of one local electron is ep, while the 
energy of two is 2e; + U. In cerium one has that £; < 0 but 2e; + U > 0. Then the cerium 
ion prefers to have just one electron in the local orbital. A related case is for triply ionized 
ytterbium which has all f-orbitals filled except one. The ground state has one f-hole. The 
theory is almost the same, but one uses N; = 8 for the upper multiplet of the spin-orbit split 
f-state. When the local orbital is a d electron the values of I are larger. 


6.2.1. No Hybridization 


The term in the Hamiltonian involving U comes from the on-site Coulomb interaction 
between two electrons on the same atom. Usually this term is far too large to be treated by 
perturbation theory. It must be included in Ho, which is the noninteracting Hamiltonian. 
However, this term makes problems in doing the Green’s function expansion. Including this 
term invalidates Wick’s theorem for local electrons. The Green’s function expansion becomes 
awkward. In evaluating correlation functions, each time ordering must be evaluated sepa- 
rately. This increases the effort, as was evident in the prior section on the Kondo effect. Here 
we use the noncrossing approximation (Bickers, 1987). 

There are several other approaches to solving the Anderson model besides the non- 
crossing approximation. The method of Gunnarsson and Schoénhammer (1983, 1987) was 
described in our second edition. Grewe and Keiter (1981) developed an alternate formalism 
for expanding the S-matrix for the local operators. They found formal but rigorous results for 
the local self-energies. Another method of finding the same results was developed by 
Coleman (1984) using the slave boson idea of Barnes (1976, 1977). One introduces a ficti- 
tious boson which is created every time there is a change in the occupancy of the local level. 
One can control the f occupancy by controlling the chemical potential of the slave bosons. A 
similar method was used earlier by Abrikosov (1965) in solving the Kondo problem. Another 
method is renormalization group (Hewson, 1993). 

The first step is to evaluate the Hamiltonian H, while setting V = 0. This step is easy, 
even while including the Hubbard interaction U. The eigenvalues of the electrons, in the 
absence of hybridization, depend upon the number n of electrons in the f-orbital. This integer 
ranges in value 0 <n < Ny. The eigenvalues are 


n(n — 1) 
2 


E, = ney + U (6.71) 


where the number of pair interactions multiplying U is n(n — 1)/2. The eigenvalue expression 
(6.71) is parabolic in the level occupancy n. An example is shown in Fig. 6.4. Since gp is 
usually negative, the minimum energy is found near to n ~ 5 — &,/U. Denote as L the value 
of n which has the lowest value of E,. The f-configuration with the lowest energy has an 
occupancy of L electrons. Note that L is increased by changing the value of ep towards larger 
negative values, while keeping U fixed. 

Mixed valence occurs when the number of f-electrons can vary between L and L + 1. In 
Fig. 6.4 it varies between the states with n = 3 and n = 4, which are at the same low energy. 
The hybridization interaction causes mixing whenever |E; — E+] is a small energy. By 
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FIGURE 6.4 Eigenvalues E, vs n for the Anderson model with no hybridization. Values assumed in the plot were: 
& = —18.0eV and U = 6.0 eV. 


“small energy” is meant values in the range of 0.1—-1.0 eV. The standard nomenclature is to 
define the two excitation energies A. out of the ground state as 


A_ = Er — E; = -[e; + (L — 1)U] (6.72) 
A, = bray —E, = & +LU (6.73) 
U=A,+A_ (6.74) 


The definition of the Hubbard U as being equal to A_ + A, follows by summing the prior 
two equations. Equation (6.74) can also be used as a phenomenological definition. Using 
band structure methods to calculate A_ and A,, one can add them to obtain a theoretical value 
for U. 

Values of A, have been calculated for the rare-earth metals. The results of Herbst and 
Wilkins (1987) are shown in Table 6.1. Generally they find one value which is small 
(A ~ 1.0 eV) while the other is larger so that U ~ 6—8 eV. It is assumed that similar values 
apply to the rare-earths ions when they are in metallic compounds. Note that samarium and 
ytterbium have small values of A, and they have a mixed valence in many compounds. 
Cerium also has a mixed valence in many compounds, although its value for A_ does not 
seem to be small. An important point is that these values do not have to be on the order of 
thermal energies, but in fact are generally much larger: a sizable fraction of an electron volt. 
Even with these large values, mixed valent behavior occurs often. 

Next calculate the partition function Z which is related to the thermodynamic potential 
Z = exp(—BQ). This quantity is needed for all of the thermodynamic averaging. Define the 
degeneracy factor Z, 


(NA N 
n= ( 7) ~ nN, — n) (6.75) 


This factor is obtained by asking for the number of different ways that n identical particles can 
be distributed among N; identical states. 

The ground state degeneracy of a rare-earth ion are given according to Hund’s rules. 
They are shown in Table 6.1. The Anderson model ignores the splittings of the f-orbital due to 
spin-orbit and other factors, and gives a degeneracy which is far too large. The Anderson 
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Table 6.1 Ground staste degeneracies (2J + 1) for valence (3+) of rare 
earths according to Hund’s rules: A, = E,,,, — E, in eV. The second entry 
for Eu has valence 2+. From Herbst and Wilkins (1987, used with 


permission) 

nf S L J 2J+1 A_ A, 
Ce l 1 3 5 6 1.8 3.2 
Pr 2 l 5 4 9 3.8 1.9 
Nd 3 3 6 2 10 5.2 1.5 
Pm 4 2 6 4 9 5.5 1.6 
Sm 5 3 6 3 6 5.4 0.3 
Eu 6 3 3 0 1 7.9 
Eu 7 5 0 z 8 1.9 8.6 
Gd 7 5 0 I 8 8.5 3.7 
Tb 8 3 3 6 13 3.2 2.2 
Dy 9 3 5 i 16 4.9 1.5 
Ho 10 2 6 8 17 5.8 1.4 
Er 11 3 6 48 16 5.9 1.8 
Tm 12 l 5 6 13 5.5 1.1 
Yb 13 5 3 7 8 7.4 0.9 


model is solved as given in (6.68), which has a degeneracy given by Z, rather than Hund’s 
rules. The partition function is 


Z=)°Z,e °& (6.76) 


When U = 0 this summation simplifies to Z = [1 + e% ^. 
The Green’s function for the f-states in the absence of the hybridization term is 


B 
GIM,) = — >| dren (Tf Ofi) (6.77) 


u JO 


The expression for 4; can be written exactly for the noninteracting states. The trace is taken 
over the states |n). The notation N; denotes the number of states for a single f-electron. The 
degeneracy of having n electrons in the f-orbital is Z, as defined above. The qt integral is 


1 Bo o 
G, = -> LZ, t | dre (nef eon + 1)(n + 1 fiin) 


1 B | 
=— 7È Ze P" (n| faln + 1)(n + A ftln) | de Ont En Ens) 
p,” 0 


e BE,, + e~ BEn+i 


1 
==>Z l 1| ft In) ————— 
z% ntal faln + 1)(n + LS I") 50, +E, ~ Ena (6.78) 


The matrix elements >a (nl flat ljin+f tjn} are easily evaluated since they are just the 
number of empty states into which one can add an electron into the f-orbital. If there are 
already n, then the number of empty spaces is N; — n. In the term with exp(—BE,,,,) change 
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summation variables n + 1 —> n. Since Z,(Ny — n) = (n+ 1)Z,,, these steps produce the 
final expression 


G (i@,) a, -BE Aon á (6.79) 
io,)=s ey . 
f É Z n=0 " 1D yy + En < Ep41 IOn + En-1 T E, 

A,(@) = 2 Im[G,(o)] (6.80) 


Qn A _ 
A,(@) = s L Ze P {(N; — n)8(@ + E, — En y1) +n8(@+E,_;—E,)} (6.81) 


The spectral function A;(@) is obtained by the analytic continuation iw,, > œ + in and then 
taking twice the imaginary part. The factors of N, — n in the first term, and n in the second, 
arise from the degeneracy of the state. In the state |n) there are n different states that can be 
destroyed by the lowering operator f, when summing over all u. This accounts for the factor 
of n. Similarly, the factor N; — n is the number of empty states into which the raising operator 
can add an f-electron. 

Let E; be the lowest eigenvalue (n = L) and assume that L is neither 0 nor N;. The most 
important terms in the summation are those for n = L — 1, L, L + 1. Factor Z; exp(—BE;) 
from each term. Energy differences E; — Ez}; are expressed as A,. These approximations 
give for the partition function 


Zx Z; _ e PEE + Z,e7 PEt + Zye pe (6.82) 
Zx% ZZ eP (6.83) 
Z Z 
Z=1+ 7 e PELE) 4 OLH o` BEL+1 E) (6.84) 
L L 
L N; —L 
F] ag BAL J —BA,. 6.85 
NI-L TIFI" (6-85) 


Other terms can be ignored since they have exponential factors exp(—BAE) with very large 
values of AF. These exponential factors are vanishing small. The same approximation for the 
spectral function gives six terms which have four different energy differences 


Ay(@) © a 150 +A_)(1 + e7P*-) + (N; — L)8(@ — A, )(1 + eP) 


pa. L-1) 
+e PA-_~"__ 8 + E; — Ez) 
Ny +1-L 
ga, (Ny — LYN, — £ - 1) 
+e Ps 0 Ery — Ey 49) (6.86) 


The approximate spectra has four peaks. Two are below the chemical potential and two above. 
We follow custom and ignore the last two terms in A;(@) and include only the peaks at 
© = —A_,A,. For the case that L = 1 then the third term is missing anyway since there is no 
state L — 2. Usually it is a good approximation to neglect terms of O(e~®4+), and in this case 
the approximate spectral function is 


A,(@) © 2n[L6(@ + A_) + (Ny — L)8(@ — A,)] (6.87) 


d 
EZIO =N; (6.88) 
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This spectral function has only two peaks: one at œ = A, and the other one at œ = —A_ 
They have relative strength of N; — L and L, respectively. There are only two peaks, since the 
Green’s function gives the probability for two processes: (1) adding an electron, or (2) 
removing an electron. In a well-defined ground state, these two processes each have a peak, 
and there are two peaks. Without the Hubbard energy U both peaks would occur at the same 
energy: there would be a single peak with weight N;. The spectral function integrates to N,, 
rather than to one, because the present definition included a summation over all of the state v, 
of which there are Ny. 

The solution to the Anderson model is rather easy without hybridization. The next step is 
to include this important process. 


6.2.2. With Hybridization 


An expression for the partition function is derived including the interaction V. Define the 
eigenstates of Hy by |n) and those of H = H) + V by |æ). 


Hy|n) = E,ln) (6.89) 
Hla) = ela) (6.90) 


The states |n} have n electrons in the f orbital. The nature of the states |x) is to be determined. 
In principle, they exist, and the derivation proceeds using that assumption. They form a 
complete set, so an insertion of }_, |x)(«| is equivalent to the identity operator. The derivation 
below proceeds by inserting this operator whenever the total Hamiltonian H appears in any 
expression. Using this process to evaluate the partition function gives: 


Z = © Z,(nje P* |n) (6.91) 
Z = © Z, (njaje”® (aln) (6.92) 
Z = Ð Z| (nja) eP (6.93) 


Next introduce the spectral function A,(w) for the many-electron state n. In prior usage the 
symbol A,(@) may have meant the spectral function of a single electron in the state n. Here 
the meaning is different. The symbol |n) denotes a state of n electrons in the f orbital, and 
A_,(@) is its spectral function. It is the spectral function for a system of n electrons. It is not the 
spectral function of a single electron, unless n = 1. 


A, (@) = 27(n|d(o@ — H)|n) (6.94) 
= 2n | (n|a)|?5(w — £) (6.95) 


where the complete set of states `, |«)(a| is inserted in order to evaluate the delta function 
containing H. This formula for the spectral function can be used to evaluate our above 
formula for the partition function. From now on, the expression ) |, \(n|x)|? will be evaluated 
as an integral over the spectral function 


Z= vz, [Letea (6.96) 
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The partition function can be expressed as a summation over the noninteracting states n. For 
each value of n, integrate over œ with the factor in the integrand of exp(—Bw)A,(@). The 
method of calculating A,(@) is described below. 


6.2.3. Self-Energy of Electrons 


The hybridization interaction V in Eq. (6.68) describes the processes whereby a 
conduction electron can hop on or off the f shell of the local atom. This hopping preserves the 
spin and orbital quantum numbers of the electron. At first it appears to have nothing to do 
with spin fluctuations. As explained in the introduction to this section, the fluctuations in 
occupancy can change the spin of the local electron. The local spin fluctuations are due to the 
hopping process. 

This process leads to several important new effects. The first is a new energy parameter 
defined as I" in Eq. (6.69). It is taken to be a constant. The second effect is that the resonance 
lines at œ = —A_, A, become broadened by an amount proportional to I. The third new 
effect is that the Kondo resonance peak appears in A;(@) near to zero energy. It is an 
important feature in the Anderson model, which is due to spin fluctuations. 

The prior subsection discussed the energy levels associated with having n electrons in 
the f state. Introduce a Green’s function for this state 


l 


On) = E,- E0) 


(6.97) 
Most of the Green’s functions in this book are for a single electron. There are also two-particle 
correlation functions, such as density—density or current—current correlation functions. The 
present Green’s function is neither of these (unless n = 1). It is a Green’s function for a state 
ofn electrons in the f orbital, where values of n have the range 0 < n < Ny. Similarly, assume 
there is a self-energy Ł,„ for this n-particle Green’s function, which can be evaluated using 
Dyson’s equation. 

The hybridization interaction causes the self-energy 2,,(@). This function is calculated 
using the noncrossing approximation (Bickers, et al., 1987). The result will be derived by an 
intuitive argument. A rigorous definition is provided below. First consider the result for the 
self-energy in the second order of perturbation theory 


x, (0) = nS, (@) + (N; — nS È (0) (6.98) 
(0) _ Vp Nk 
Sp+1 (©) = SEs u h PESEN (6.99) 
§@) — =E Ve =n (6.100) 


o — (E,_; + &) + in 


There are two terms. The first term (f'C,) in V adds an electron to the f levels, which 
changes n to n+ 1, while destroying a band electron in the state k. This process is propor- 
tional to the occupation number n, of the band electrons, and gives the term S © (a). Again 
the factor of N; — n is the number of empty states into which the electron can be inserted into 
the f orbital with n electrons. The second term (ci f)in V takes an electron away from the f 
level, which changes n to n — 1, while adding an electron to the band. This process is 
Pr ortional to the probability (1 — n,) that the band state is unoccupied. It gives the term 
Ss © (o). These two processes provide the two terms in the above expression for È. 
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Note that the arguments of S® are Green’s functions G, without the self-energies. 
l 
SO (o) = N D Ven,GO (0 + &) (6.101) 
k 


The “non-crossing approximation” (NCA) is to evaluate these Green’s functions while 
including these self-energies. Put the self-energies in the denominators 


Z, (0) = n5,- (0) + (N; — mS, 41(0) (6.102) 

1 
Sny (0) = 57 Verena (O + &) (6.103) 
S0) = FEA -m00 8) (6.104) 


The above nonlinear equations comprise a self-consistent definition for the self-energies £. 
The values of n range over a limited number of possibilities (0 < n < N;), so only a few 
functions of energy need to be determined. They can be solved by iteration on the computer. 
The temperature enters into the expressions through the Fermi—Dirac occupation numbers n,. 
The Kondo resonance is quite temperature dependent, and must be calculated anew for each 
temperature. 

The results are illustrated by a simple example. Assume a density of states with a 
Lorentzian shape. 


(c) = 2D 
& = 224 pe 


and place the chemical potential at e = 0. This choice has the advantage of not introducing 
band edges. The parameter D is an effective band width and [ = rg(0) V(0)? = 2nV(0)"/D. 
In S’(@), change variables e => —e which now makes Sp, = S,,. The main integral to evaluate 
is 


(6.105) 


I de 
S, (0) = aliro” Ote + €) (6.106) 


Next discuss the derivation of Eq. (6.102). Again define |n) as the exact eigenstate of Ho 
while |æ) is the exact eigenstate of H = Ho + V. The definition of the Green’s function is 


l 


G,(@) = (n| sa i” (6.107) 
E \(oeln) |? 
=L E 4m (6.108) 


Again a summation over the states |x)(x| was inserted in order to evaluate the energy 
denominator with the Hamiltonian H. This result could also be obtained using the Lehmann 
representation. 

The self-energy XZ, can be obtained by a similar argument. One starts with the state |n) 
and evaluates the self-energy in the second order of perturbation theory. The interaction V is 
regarded as small, so an expression valid to O( V?) is considered adequate. The general 
expression is 


a lEn 
nO) = oe 


(6.109) 
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The intermediate states |/) are taken to be the set |x). The perturbation V has two terms: one 
(f'C,) adds an electron to the f orbital while removing an electron from the conduction band. 
The other term (ail f) removes an electron from the f orbital while adding one to the 
conduction band. These two processes generate the two terms: 


ofml(al fine A- rolera] 
x,(@) = D Vie ecco Te) + ey) E + &) (6.110) 
o fallna + iin + 1 ftin)P (A npn — 1)(n — uem] 
we rk | © — (E, — &) t @ — (E£) (6.111) 


l 
= N 2 Vil(Ny — n)n,G n+1 (O + Ez) + nil 7 nN; )Gy_ 1(@ — Ex )} 


The summation over intermediate states (Z| includes the exact states («| and the eigenstates of 
the band electrons. They are summed oxpiiciny in the second line. A factor of |n + 1) (n 1| 
was also inserted, to use the fact that (n + 1| f'|n) and (n — 1| f]n} are well defined. This 
produces in (6.111) an expression which is obviously the Green’s function. The last equation, 
in the above set, is the expression for the self-energy in the non-crossing approximation. 

A similar technique can be used to derive the spectral function A,(@). Start the derivation 
with the general definition of the one-electron Green’s function: 


1 Bo 
Flik) = -3 A | dre" (nje P-E feH ft n) (6.112) 
n 0 


This Green’s function is the usual one, similar to those in other chapters, where the definition 
entails adding or removing a single electron from the system. It is the function which is of 
physical interest, since it directly relates to the density of states and other quantities. On the 
other hand, the Green’s function G,,(ik,,) is that of the n-particle state. It is the result of adding 
n electrons to the system. It generally is hard to measure. In the present case G, is useful since 
it is used to calculate G,. 

The expression in brackets can be written exactly by inserting summations over exact 
states |x) and |À), as well as noninteracting states |n + 1) and |n’ + 1). 


D (njaj P (an f In! + 1) (n! + Ae (Aln + 1) (n 1 ft In) (6.113) 


aAn’ 


The usual approximation is to restrict the summation to the terms with n’ = n. Gunnarsson 
and Schénhammer (1983) consider the terms with n’ =n + 1 and find they make a small 
contribution. They are a type of vertex correction. Setting n’ = n simplifies the expression. 
After performing the integral over t 


e Pe, + e Ber 


G (ik,) = =F Z,(N; — 210 25S 
(thy) = 7% ZulNy -AE (cin) PCA D 


The matrix elements such as |(«|n)|? and |(Ajn + 1)|? are expressed in terms of the spectral 
functions A, and A,,, which produces the result 


e Be + e Be’ 


l 
8 lUby) = 7 ZAN; -n | Ay) EO 
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Taking the imaginary part of the retarded function gives the delta function 5(@ + € — £’) 
which can be used to eliminate one of the energy integrals. After rearranging a bit, 


LY [dE g | 
AKO) = 5 Zh | xe An(E) nd, iE — 0) + (Ne = MA iE+)} (6.114) 


The final equation is the result (6.114) for the spectral function A,(@). Note that the integral 
for A,(@) is a convolution of the spectral functions for adjacent values of n. In practice, the 
only important terms in the summation are those with n = L —1,Z,2+1. Again ignoring 
terms of O(e~®4+) then this expression simplifies to 


1 (dE 
ao) © a | Fe MA (EMLA, (E — 0) + (Ny — Didi (E +0) 
Z, on (6.115) 


~~ 


dE _ 
Z © [Ze BE 4 (E) 


A numerical example of these equations is presented. For cerium, one typically chooses 
values such as e; = —0.5 eV, U =6.0eV, and I’ = 0.1 eV. This gives Ey = 0, Ey = —0.5 
eV, E, = 5.0eV, and the other levels are unimportant. The width parameter in the Lorentzian 
density of states is D = 1.0 eV. The energy differences are A_ = 0.5 eV and A, = 5.5 eV. The 
ground state has L = 1 and the mixed valence is between L = 1 and L = 0. Figure 6.5 shows 
these spectral functions evaluated at T = 300K. The spectral function for the ground state 
A,(E) is almost a pure Lorentzian with small width. This small width is provided by the small 
value of Im È, due to its interactions with the neigboring state with L + 1. From Eq. (6.102) 


—2 Im[z,()] = xe V2((1 — n)A (0 — £4) (6.116) 
+ (N; — 1)n,Ao(@ + €)] (6.117) 


dé 
= I | ———,, n,-(£)[Ap(@ + £) + (Ny — 1)A(O e 
| pre Oet) + WY ~ o+) 
The integration variable was changed in the first term £ — —e which is only possible when 
u = 0 and with a symmetric density of states. Both terms in the integral contribute about 
equally: Although E, is further away in energy, and less effective, the factor of N, — 1 
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FIGURE 6.5 Spectral functions for cerium at T = 300K: (a) Ap, (b) A,. Energy units are eV. 
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multipling A, increases the importance of this term. The 1/N; expansion is based upon the 
premise that terms of O(N;) are much larger than those of O(1) and the latter can be 
neglected. Our experience is that this assumption is not accurate numerically. 

The spectral function for Aj has a main peak at Ey which is nearly Lorentzian. The 
broadening is caused by I. However, there is also a narrow peak at the energy E, of the 
ground state. It is due to the self energy Im[X | defined by 4;: the latter function has a peak 
there. The peak at the ground state energy, in Ap, is evidence that this state is fluctuating with 
the ground state. The spectral function of A, does not show a peak at the energy E}, since £, 
is too far away. 

The spectral function A;(w) is found by the integral in Eq. (6.114). It is shown in Fig. 
6.6. Since both A, and A, have sharp peaks at the same energy E}, their convolution has a 
sharp peak near to zero energy. The peak near to zero energy is the Kondo resonance. The 
Kondo resonance has a narrow width. This narrow width is close in value to kgTg, where Tx 
is the Kondo temperature. The Kondo phenomena is caused by this sharp peak, and the width 
of the peak—its characteristic energy—defines the energy scale which in turn defines the 
temperature scale. The Kondo temperature for this case is about Tg = 300K. The height of 
the Kondo resonance decreases rapidly with increasing temperature. 

The Kondo resonance was first derived in the prior section, where it arose from the spin- 
flip scattering of a conduction electron from a local impurity spin. The present Kondo 
resonance is the same peak, calculated from the Anderson model rather than from the Kondo 
model. In both cases, it arises from the interaction between the conduction and local electrons. 
In the Anderson model, the Hubbard U appears not to play a major role. It only enters into the 
energy denominators. However, this term is the major reason for the Kondo resonance. 
Setting U = 0 causes the energy denominators to become equal. The Fermi factors cancel in 
the numerator of the self-energy expression, and one has a simple function with dull features. 
For U = 0 the Kondo resonance is not present. The energy U for pair interactions is the major 
cause of the Kondo resonance. The Kondo resonance is the major macroscopic manifestation 
of the spin-flip scattering. This resonance explains the unusual behavior in the heat capacity 
and other measurements. 
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FIGURE 6.6 Kondo resonance as a function of energy for cerium. Note the expanded energy scale. 
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As an example, calculate the heat capacity for a system with a concentration c of such 
Kondo resonances. This calculation uses the formula 


(6.118) 


C(T) = | dEA,(E)E (- dnr (E ) 


dE 


The heat capacity rises linearly with temperature at small temperatures. The formula for the 
free-electron gas C ~ m*k3T/E;, is used to estimate the effective mass m*. One finds a value 
for this system of m* = xm,, where m, is the mass of the electron and x is very large. The 
system appears to have a very large effective mass. This result is the origin of the name 
“heavy-fermion”. The fermions are not that heavy. There is just a peak in the density of states 
due to the spin fluctuations, which explains the large value of the heat capacity at low 
temperature. 

The low-temperature properties were not calculated correctly. If there is a concentration c 
of such cerium atoms, then at low temperature the system develops a band of f-states, as the f- 
electron hops coherently from cerium to cerium. The f-electrons develop their own Fermi 
surface. This subject is rather interesting, and is reviewed by Zwicknagl (1992). The band of 
f-states vanishes at higher temperature, as hopping becomes less probable. This behavior is an 
example of electronic polaron model of Holstein (see Liu, 1988). 

These results can be used to calculate the electrical resistivity p(T) and the Seebeck 
coefficient S(T). The usual assumption is that the scattering rate for the conduction band 
electrons—their inverse lifetime—is proportional to a constant term from impurity scattering, 
a term from phonon scattering, and a term from scattering from the local f states. The first two 
terms are lumped into t)(7). The latter term is taken to be proportional to their concentration 
c and the spectral function A, along with another constant u 


l l 

_ To(T) 
(0) = 1 + cutyA-(o) (6.120) 
o(T) = eo, | aer) (- ae ) (6.121) 
ToS = 0, l dEE:(E)g(E)( — ae ’) (6.122) 


where ©, is a constant. The factor of g(£) is the density of states. The factor of the square of 
the Fermi velocity is part of o,;. This expression was derived by Mahan (1997). 

The Kondo peak has an energy scale very different to that of the scattering due to 
phonons or impurities, and the latter contributions are treated as constants. Note that the effect 
of the Kondo resonance is to cut a hole in the energy spectrum of t(@) since the factor of 
A,(@) is found in the denominator. Figure 6.7 shows the calculated results of S(T) for this 
model for cerium. Calculations of this kind were first done by Bickers et al. (1987). They 
show there is nearly a universal curve for the different cases when plotting the results on an 
axis of the logarithm of T/Tx. Different choices of the model parameters (es, U, T, Ny) give 
different values for the Kondo temperature 7,. The curve for the Seebeck has a universal 
shape and maximum value (~100 np V/K) when plotted on a log scale. Note that S is the ratio 
of two integrals: If the concentration c of f levels is large, then the factor of c cancels between 
the two integrals. This cancellation is the reasons the curve for S is universal. 
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FIGURE 6.7 Seebeck coefficient S(T) for a model appropriate to cerium. 


Figure 6.8 shows the Seebeck coefficient for CePd, and YbAI, plotted as a function of 
temperature using data from Jaccard and Sierro (1982). Both the peak values, and the general 
shape agree with the model calculations. These Seebeck coefficients are large for metals. 
Most metals have values in the range of 2-5 uV/K. The Seebeck coefficient is large whenever 
g(E)t(E) has sharp, asymmetric, structure within 2kgT of the chemical potential (Mahan and 
Sofo, 1996). In this case, the sharp structure is the Kondo resonance. 

The Anderson model describes spin fluctuations. There are also charge fluctuations as 
the electron hops on and off of the local orbital. The present theory neglects them. They could 
be described by including a term in the Anderson Hamiltonian of the form 

Veg = 2 fi V 2 Viw Cs C's (6.123) 


There is a Coulomb interaction between the local f electrons and band electrons. The latter 
scatter from k’ to k. The matrix element, and the scattering, depends upon how many local 
electrons are present. One can call the ion valence Z and the sum over local electrons controls 
the value of Z. When an electrons hops, say, off the orbital, then this sum changes. The theory 
of this charge fluctuation encounters singularities identical to the Anderson renormalization in 
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FIGURE 6.8 Absolute Seebeck coefficient as a function of temperature for CePd, and YbAI. The results for cerium 
are positive and for ytterbium are negative. 
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the x-ray edge problem, which is discussed in Chapter 9. They occur whenever one adds or 
removes a local electron which has a Coulomb interaction with the local electrons. This 
theory is discussed by Liu (1988). 


6.3. HUBBARD MODEL 


The Hubbard model has become the standard Hamiltonian for the investigation of 
electron behavior in metals where the correlation effects are quite strong. Many analytical and 
numerical techniques have been applied to this Hamiltonian. It is routine for young theorists 
to announce that they have “solved the Hubbard model”. In spite of this huge effort, 
extending over many years, there are many features of this Hamiltonian in two and three 
dimensions which are not understood. 

This section discusses the Hubbard Hamiltonian Hy and also the extended Hubbard 
Hamiltonian 7, 


Hy = -t $ C} s oCo + U E nyn (6.124) 
joo j 
nio = C} Co (6.125) 
Hy =Hy +5 Yo nonw VR — R) (6.126) 
j#l,oo' 
e? 
VR) S y (6.127) 


The summation j is over lattice sites. The summation 6 is over the neighbors of j. The 
summations over ©, o’ are over spin indices. The term ¢ is a hopping overlap energy between 
first neighbors. It is assumed to be the same for all neighbors. This choice implies s-wave 
symmetry: the orbitals are the same in all directions. Each site has the same symmetry. It is 
possible to take a model with more complications, but the present one is hard enough. 

The extended Hubbard Hamiltonian includes the Coulomb interaction between electrons 
at distance sites. This term is essential to obtain the correct charge fluctuations. The pure 
Hubbard Hamiltonian Hy does a good job describing spin fluctuations, but does a poor job on 
charge fluctuations. 

The Hubbard model has several energy scales. The first 1s the on-site Coulomb inter- 
action U, and the second is the bandwidth W = 2zt, where z is the number of nearest 
neighbors: it is the number of terms in the summation over 6. The behavior is very different 
depending upon the ratio W /U. If this ratio is large, then the value of U is small compared to 
the bandwidth. Here the free-electron model of metals works well. The case of simple metals 
such as aluminum or sodium have W > U. The other limit is when U > W. Here electron 
correlations are dominent. This case is only poorly understood. It is known that if the band is 
exactly half-full, with one electron per site, then the system will be antiferromagnetic when 
U > W. There have been many attempts, beginning with Penn (1966), to derive a phase 
diagram at zero temperature to predict all of the different order parameters at different 
concentrations n and different values of U/W. Phase diagrams are discussed in the next 
section. 
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6.3.1. Spin and Charge Separation 


Both the Hubbard model, and the extended Hubbard model, can be written in a form 
which demonstrates the separation of charge and spin degrees of freedom. The extended 
Hubbard model is discussed here. The Hubbard model is obtain, at any point in the derivation, 
by merely setting to zero the interaction V(R). 

The first step is to change the Hamiltonian to wave vector space. 


1 , 
Cig = wwe e™ BC (6.128) 
—t 3 Chis Cio = 2 e(k)Cİ Cko (6.129) 
joo o 
e(k) = —t $ e™ ° (6.130) 
Ò 


The term in e(k) is the kinetic energy. The other terms are the various Coulomb interactions. 
The Fourier transform of the extended Coulomb interaction has an interesting form 


v(q) = X} et Y(R) (6.131) 
JF#O 
0= > vq) (6.132) 
q 


The term with j Æ 0 is omitted from the summation since it is the Hubbard interaction U. The 
term U is included separately from v(q). The second formula, in the above set, shows that the 
summation of v(q) over all values of q gives zero. This happens because the summation of 
v(q) over all q gives V(R = 0) and that term is omitted from the definition of v(q). 

The actual summation must be done with Ewald methods. At small values of q, 
v(q) —> 4ne*/(g7V>) where Vo is the volume of the unit cell. The potential v(q) behaves as a 
normal Coulomb interaction at small values of g. However, since the summation of v(q) over 
the crystal Brillouin zone of q values must give zero, then every region of positive value must 
be cancelled by another region where v(q) is negative. Generally the negative regions of v(q) 
occur at the points in the Brillouin zone with larger values of q. Negative values occur from 
the phase factor exp(iq : R,) in Eq. (6.131). It can be negative when the exponent is near to n 
radians. 

The Coulomb interactions are written in terms of the density operators of the electrons. It 
is convenient to introduce the electron density operator p,(q) of a single spin, where 
o = +, |. The charge density operator p.(q) is the sum of both spin components, while the 
spin density operator p,(q) is their difference 


Pola) = X ChigoCko (6.133) 


P.(q) = P,(q) + p, (q) (6.134) 
p.(q = P,(q) — Py (q) (6.135) 
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The terms in the Hubbard interaction and in the extended Coulomb interaction can be written 
in terms of these operators. 


U Limam = 2 p;(a)p,(—4) (6.136) 
= ay y DLPe(@PA—@ — Ps(@)p.(~a)] (6.137) 
LS nomo V (Ry) = 55 Dpp- (6.138) 
? flee’ q 


The two interactions can be combined into a charge and a spin interaction 


V, = 5 Diol) +4 UPDP) (6.139) 
q 


U 


It is interesting that the effective interaction for charge fluctuations is v.(q) = v(q) + U/2. 
This quantity can also be negative at large values of q. There is a simple proof due to Chen 
and Mattis (1991) that v(q) + U must always be positive. Assume that the charge distribution 
on an atomic site is (r), then 


vr(q) = U + v(q) (6.141) 
e? 


vr(q) = | Prr AOE FR] “ (6.142) 


The summation over j includes R, = 0 since that is U. The summation can be written as a 


periodic potential and therefore expanded in reciprocal lattice vectors G 


e a: 4B, Ane? G-a 
oP ERIS “= V Ge” oe (6.143) 
j j G holqa + GI 
4re -~ 
vr(@) = > ———-5 |o(q + G) (6.144) 
i Tat GP 


$a) = l Prè olr) 


The right-hand side of Eq. (6.144) is a sum of positive terms, so it is strictly positive. 
Although v(q)+ U is strictly positive, the effective Coulomb interaction v.(q) = vr — 
U/2 = v(q) + U/2 may be negative at large q. 
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The next step is to evaluate the correlation function of these operators. Assume the 
system is nonmagnetic, so the average spin-up and spin-down correlations are the same. First 
assume V, = V, = 0 then 


P,a, id,) =+ Lf ae (T,p.(a, 2)p!(a, 0)) (6.146) 
1P, 
-Ż | dre" (T ip (q, DP} (a, 0) + pCa; DPI, 0) (6.147) 
= Pq, iq,) (6.148) 
1p 
P5(Q; iqa) = -> | dre" (T.p,(q, DP} (a, 0)) (6.149) 
ip 
==]. deel (Tlp la DPA 0) + p,a TP} (a, ON) (6.150) 
= Pq, ig,) (6.151) 
1p 
(Qs) = —— | dret tT pela Pia 9) (6.152) 
1 £8 
=~) are" (Tele (4, DP (4, 0) — pa Dea ON) (6.153) 
— 0 (6.154) 


where P‘)(q, ig,) is the usual polarization function for the electron gas, which was discussed 
extensively in Chapter 5. Terms such as (p,p,) are set to zero since different spins do not 
correlate if there is no interaction. 

These simple results are the basis for the separation of the spin and charge degrees of 
freedom. This separation is exact in the Random Phase Approximation (RPA), but is only 
approximate when evaluated with additional terms such as vertex corrections. The RPA is a 
series of bubble diagrams obtained by evaluating (ppt) = PO). In a nonmagnetic system 
(Pe pi) = = 0 and the spin and charge interactions do not mix. 

For example, now include the interactions V,, and evaluate the correlations in Eqns. 
(6.146) and (6.149) using the RPA. In both cases the result is obtained by summing the bubble 
diagrams 


PO (q, ig,) 
1 — v,(q)P(q, ig,) 


Pq, ig,) 
U 


P.(q, iq) = (6.155) 


P5(q, iq) = (6.156) 


The denominators of these two expressions are very different. One gives the modes associated 
with charge fluctuations, while the other the modes from spin fluctuations. In Chapter 5 these 
types of expressions were evaluated for electrons in energy bands with parabolic dispersion. 
Let iq, — © + in where n is infinitesimal. The limits depended on the dimensionless ratio of 
x = @/(qvr). For large x one had plasmon modes, while for small x one had static screening. 
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In the present case, evaluate these expressions for energy bands described by a tight- 
binding model, such as Eq. (6.130). For tight-binding bands define an average velocity 
(v) = (V,€(k)). Again define x = w/(q(v)). The polarization function is 


2 Ny — M+ 
PY(g, o) == > ———"k "era 6.157 
(a, ©) N X &(k) — sk +q) + o + in ( ) 


where the factor of two is for spin summation. Note that the normalization here is slightly 
different than in Chapter 5: the summation is divided by N rather than by v = VaN where Vo 
is the volume of a unit cell in real space. 

x = 0: First examine the case that x = 0 which is achieved by setting œ = 0. Then the 
polarization function is static. Next, consider the limit that q — 0. For tight-binding bands 
take e(k + q) > e(k) + v; + q. The latter expression is also used in the occupation function 
N+q = nr (elk) + vz * q)) which is expanded for small q 


ayy, ay. 2 ondeto] — Ire) + vy + adn /de(l) 
tim PG) = a(k) — [e(k) + vy q] 


= —2N, (6.158) 
1o (_ dnek) 
N; = > Fe ) (6.159) 


The factors of v; *q cancel in numerator and denominator, and the final result depends only 
on the energy derivative of the occupation function. This derivative is a delta function at zero 
temperature. The density of states for a single spin state is defined as Np. Using this result in 
the two susceptibilities (6.155) and (6.156) gives as q > 0 


—2N; —2Nž 
e(a: 0) 1+N-[2v(q)+U] 1+k?/g? (6.160) 
—2Np 
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The Hubbard interaction causes a renormalization of the density of states, for charge fluc- 
tuations, to N*#. The screening at small wave vector has the Thomas—Fermi form, with a 
screening wave vector k, define in Eq. (6.163). 

The spin susceptibility P, has only constants in this limit of x —> 0. Its denominator is 
interesting since it is possible that 1 — UN; < 0, which is the Stoner Criteria for the onset of 
a magnetic phase. The phase could either be ferromagnetism or else antiferromagnetism. 
These possibilities are discussed in the following section. The Stoner criteria assumes that the 
polarization function P“)(q, 0) has its largest negative value at the center of the Brillouin zone 
q = 0. However, antiferromagnetism occurs when 1 < UP“)(q, 0) for nonzero values of wave 
vector q. Other interesting choices might have q at the edge or corner of the Brillouin zone. In 
this section, it is assumed that the Stoner criteria is not satisfied, and there are no collective 
magnetic states. Our interest is in the spin and charge fluctuations for the paramagnetic state. 
Magnetic states are considered in the next section. 
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Large x: Now consider the other limit where œ >> q(v). Start with Eq. (6.157) and 
rewrite it by changing variables k — —q — k’ in the term with nų +q» Which gives 


a) _2 Fy 
PING: ©) = 5 Le nre) Es ekt +0" ek) ekt l 


Next expand the denominators in a series in inverse powers of the frequency. The expansion 
parameter is [e(k) — e(k + q)]/œ. The first nonzero term is 


lim POG, 0) > -4 D nre) — elk + q) (6.164) 
0—0 oN E 


The part in square brackets is expanded in powers of q. The first nonzero term is proportional 
to q’. In the first-neighbor tight-binding model it is 


2,2 
2 S nple(k)fe(k) — ek +0] = ZE nek) a: Se =2 4 Ey (6.165) 
N* NÈ 5 d 
(Ex) = 5 Dnpleltde) (6.166) 
Dia oy CD (Fx) 
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The summation over k removes the angular dependence from exp(ik-6) and then the 
summation over 5 averages (q°5)° to (q8) /d. The symbol d is the dimension. A similar 
result is obtained for the free electron gas. Note that the average kinetic energy (Ep) is a 
negative number, so (—E;) is positive. This result for P® in three dimensions is used in 
Eqns. (6.155) and (6.156) to give 
l (q8) (—Ex) 
P.(q, ©) = 302 (+ bg) (6.168) 
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The correlation functions P, , have denominators which depend upon frequency. Zeros of the 
denominators represent excitations in the system of particles. The charge fluctuations have 
their normal behavior of a plasma excitation œ, at long wavelength. The Hubbard interaction 
gives the plasmons a quadratic dependence upon wave vector. Note that in the pure Hubbard 
model, where the long range Coulomb interactions are set to zero, then the excitations in the 
charge fluctuations become acoustic œ = cyq. The Hubbard model gets the charge fluctua- 
tions wrong, in that it predicts linear waves at long wavelength. The extended Hubbard model, 
which includes the long range of the Coulomb interaction, gets the correct behavior of a 
plasma frequency at long wavelength. 

The spin fluctuations give damped modes œ = +iqcy. They have very different behavior 
than the charge fluctuations. These results were derived only using the RPA. However, the 
general trend of the results is thought to be valid for values of U < W. 
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6.3.2. Exchange Graphs 


The Hubbard Hamiltonian contains the interaction U multiplied by the combination of 
n,,n;, for every site j. The argument in favor of this form of the interaction is that the 
exclusion principle forbids two electrons of the same spin from occupying the same orbital on 
the same site. Only electrons of the opposite spin can occupy the same orbital, and have the 
interaction U. This feature is built into the Hamiltonian, which assumes the orbital is an s- 
state which has only spin degeneracy. 

There is another way to treat the problem. Instead of building the symmetry into the 
Hamiltonian, one builds it into the wave function. In this formulation, the Hubbard and 
extended Hubbard Hamiltonians are 


Hy = t$ Cis Cio HU E nyt (6.172) 
jbo joo’ 
nio = C} Co (6.173) 
l 
Hg = Hy -+ 7 > Nighio' V(R; — R,) (6.174) 
JAl,oo’ 


Now the interactions are between all electrons, regardless of the spin. The feature that no two 
electrons, with parallel spin, can occupy the same site must be built into the wave function. In 
our case, it must result from summing the correct sets of diagrams. Which set of diagrams 
ensure the correct symmetry? 

First convert the Hamiltonian to wave vector representation. For the extended Hubbard 
model 

Ha = LAWL Cus +z E orp (6.175) 
ko q 
Note the Coulomb interaction contains only the charge fluctuations, with the full interaction 
vr(q). Writing this interaction as pp allows a particle to interact with itself. This contribution 
is excluded. Starting from this interaction, the separation of charge and spin is less obvious. 
Equation (6.175) is called the “full Hubbard model”. 

The various terms in perturbation theory are examined to find the cancellation which 
ensures that only electrons of opposite spin occupy and interact on the same lattice site. 
Generally, the cancellation is done by “exchange graphs”. Repeat the discussion in Sec. 5.1. 
and examine a few terms in the perturbation expansion. In the first order of perturbation 
theory there are two terms: Hartree and exchange. In the discussion of the jellium model the 
Hartree term gave no contribution: it utilizes the q = 0 term in the interaction v;(0), which is 
absent in jellium. However, here there is a q = 0 term which is U. 


Lic = U(ng + nz) (6.176) 
1 
Zo(P) = -Uns — =, 2 V(G)Ny+q (6.177) 
1 
LG t Èo = Ung — N > 0(Q)"p 49,0 (6.178) 
q 


where o is the spin of the electron, while o signifies the opposite spin. The symbols n, 
denote the average concentration. A paramagnetic system is assumed, which has nz = ng. 
The total first-order self-energy, including Hartree and exchange, is Un; for the Hubbard 
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model, which is precisely what one gets from the usual Hubbard Hamiltonian with the 
interaction U }°, n;oħ;a. The extended Hubbard model also has an exchange term of the usual 
form for charge fluctuations. The same cancellation occurs in the second order of perturbation 
theory. The RPA part of the self-energy is the diagram in Fig. 6.9(b), while Fig. 6.9(a) shows 
Hartree and exchange. Figure 6.9(b) is a closed loop connected to the Green’s function by two 
interactions. Let P,, denote the closed loop polarization diagram for electrons of spin o. In the 
usual Hubbard model, the closed loop must have opposite spin of the electron, so it gives a 
self-energy term with an effective interaction of U*P;. In the usual extended Hubbard model, 
the interaction v(q) can interact with a closed loop of either spin. The self-energy in second 
order, from the usual extended Hubbard model, is 


ZDP) = LY Gp + PDA? + 2Uo(q) + U7 (6.179) 
q 


The term with 2Uv(q) comes from Fig. 6.9(b) where the two interaction lines have U on 
one side and v(q) on the other: the factor of two since there are two ways of doing this. For the 
full Hubbard model in (6.175), with all spin interactions, the same diagram gives 


Pp) = ÈI Gp + a)Po(a)L2v(q)" + 4Uo(q) + 207] (6.180) 
q 


which is different. Including the exchange diagram in Fig. 6.9(c) makes these two self- 
energies more alike. 


Dp) = -5E orl@r(a)90 +IP +IP +444) 
99 
= -ŻE Ip + gPalqlU* + 2U0(q)] 
q 


l 
-7a Vala) + NAP + IR +a +a’) 
qq 
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Adding the two self-energy terms gives 


2P) = FE IP + DP? + 2U%q) + 0?) 


— a D (g(a) 4p + )9@ + q)9R+4q+q) (6.182) 
qq 


The first term is identical to (6.179). The exchange graph Fig. 6.9(c) cancels out the second- 
order parallel spin terms in the RPA term (Fig. 6.9(b)) for the Hubbard model. Note that the 
exchange graphs also produce exchange contributions from the charge fluctuations. 

The RPA expansion for the extended Hubbard model treats the interaction U differently 
from v(q). The exchange graphs must be included in order to cancel out the parallel spin parts 
of the U interation. When doing RPA for the charge fluctuation interaction v(q) the exchange 
contributions are usually omitted. Starting from (6.175) demonstrates this difference. This 
different treatment may not cause problems. The exchange graphs from charge fluctuations 
give large contributions to the energy of the electrons, but contribute little to the dynamics: 
collective modes, etc. 


6.4. HUBBARD MODEL: MAGNETIC PHASES 


The Hubbard model has various phases with magnetic ordering. The usual metallic 
phase, with no magnetic ordering, is called paramagnetic and is abbreviated “P” on phase 
diagrams. It has equal number of electrons with spin up and down in the various bands, and 
the spins have no ordering except local correlation. They have short-range but not long-range 
order among the spin alignments. Other magnetic phases which are considered here are: (1) 
ferromagnetic, where there is a net magnetic moment in some direction, which is caused by 
having a majority of conduction electrons with spins pointing in the same direction. It is 
denoted as “F” in phase diagrams (ii) antiferromagnetic ordering where there are an equal 
number of conduction electrons with spins up and with spin down. The symbol “A” is used 
for this phase. The distinction between PM and AF is that in AF the spins alignments have 
both long-range order and short-range order. These three phases will be discussed in great 
detail below. 

Other magnetic orderings are possible. A charge-density wave is denoted CDW. Its 
signature is that the average value of the charge density operator 


(PD) = X (CktgoCko) (6.183) 
fe} 
has a value of O(N) for some nonzero value of q. CDWs usually occur when the vector q 
connects two regions of the Fermi surface, which is called nesting. Similarly, one form of 
spin-density waves (SDW) is given by 


(Ps) = (Chaar Cur) — (Chg Ci) (6.184) 


The existence of CDW and SDW was first suggested by Overhauser (1964, 1978), and they 
have been found in many materials. They are not discussed here. Some work has suggested 
that the Hubbard model has regions of its phase space where there exist CDW and SDW. 
The ground state energy of the Hubbard model will be solved for the various magnetic 
phases. Depending upon the variables, each of the three phases may have the lowest energy, 
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and be the ground state. What are the important variables? There are two energy scales: the 
Hubbard U and the bandwidth W = 2zt of the kinetic energy, where z is the coordination 
number. Usually the results are plotted according to the ratio of these two variables, such as 
the combination U/W. The inverse ratio is used sometimes. Another important variable is the 
concentration n of conduction electrons on the lattice. In the usual Hubbard model, the band 
is full when n = 2. This case is uninteresting since none of the electrons can move and the 
system is an insulator. The lattice has an average of one electron per site when n = 1. This 
case is important since the AF phase occurs when n œ~ 1. The temperature is an important 
variable, since the magnetic phases disappear at high temperature. In two dimensions, they 
disappear at any nonzero temperature, since the Mermin—Wagner theorem (1966) states there 
is never long-range order in 2D. 

The Hubbard model has particle hole symmetry, which means that the phase diagram is 
symmetric. It has reflection symmetry around the point n = 1. Any phase that exists for 
0 <n <1 also exists for n —> 2 — n. This simple theorem is easy to prove. The reference 
state for electron calculations is the quasiparticle vacuum |0),. We define another reference 
state |FB} which is the state if the electronic band is completely full of electrons: the state with 
n = 2. The symbol “FB” denotes “full band”. It is easy to calculate the energy, of this state in 
the real space representation. Since all states are full, the kinetic energy term } C!C IFB) = 0. 
The state on j + ô is already occupied and the creation operator gives zero. In real space, 
all site occupancies n,, are one, so the Hubbard interaction is UN. The result is 
H|FB) = NU|FB). 

A hole is defined as an electron missing from the state |FB}. Let |0), denote the “hole 
vacuum” and the hole operators are d, dt. Then 


C;olFB) = di; 0), (6.185) 
CjolFB) = djs 0), (6.186) 


Destruction of an electron is equivalent to the creation of a hole. The spin changes sign 
between the electron and hole operator. The destruction of an electron of spin up leaves the 
band with net spin down, which is assigned to the hole. Using the commutator for Fermions, 
the number operator for electrons is changed to a number operator for holes 


T 
C} C;olFB) = djs dalon (6.187) 
= [1 —di,dj5]l0), (6.188) 
H, = H, + UN — n) (6.189) 
H, = p I 5.640 t Uy none (6.190) 
h 
ne =a, (6.191) 


The hole Hamiltonian H, has nearly the same form as the original electron Hamiltonian H,. 
The difference is that the hopping term ¢ has changed sign. This inverts the bands, which is no 
Oe In the square lattice we can select as the center of the Brillouin zone the wave vector 

= (n, T)/a. Since cos(0, + Q,) = —cos(0,), this choice of band center inverts the bands 
beck to their original form. An alternate, but equivalent procedure, is to define the hole 
operators with a minus sign (C;,|F'B) = -di = |0),) on one of the two sublattices, which then 
retains the negative sign in front of the hopping term. The hole problem is identical to the 
electron problem. Any electron state of n has the same hole state of 2 — n. This proves 
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particle-hole symmetry. The constant terms in the transformation, NU(1 — n), are phase 
independent so do not influence the selection of the magnetic ordering. This discussion 
completes the digression into the explanation of particle-hole symmetry. 

Another important variable for magnetic ordering is to choose the lattice for the 
calculation. The results depend upon the choice of the lattice, and also depend upon whether 
the calculation is done in two or three dimensions. The early work on the Hubbard model 
concentrated on three dimensions, and most calculations were done for the simple-cubic (sc) 
or the body-centered-cubic (bcc) structures. The discovery of high temperature super- 
conductivity in the copper oxides, called “cuprates”, heightened interested in layered 
compounds. This focussed attention on the two dimensional lattice. Most recent work has 
been done on the square lattice (sq) in two dimensions. This lattice has the advantage that the 
Brillouin Zone is a plane and is represented nicely on a sheet of paper. The sq lattice has 
coordination number z = 4 and W = 8t. The hopping energy, from the interaction with 
nearest neighbors, is 


e(k) = —2¢[cos(0,) + cos(9,)] (6.192) 
0, = ka (6.193) 


where a is the lattice constant. 


6.4.1. Ferromagnetism 


In thinking about the ferromagnetic state, it is useful to keep in mind two theorems. The 
earliest was by Kanamori (1963), who proved that in the limit of large U, a system which is 
doped slightly away from half filling is always ferromagnetic. The second theorem, by Vilk et 
al. (1994), is that the the Hubbard model on the sq lattice is never ferromagnetic. These two 
theorems directly disagree, and show that the Hubbard model is still not completely under- 
stood. 

The ferromagnetic state has a majority of spins pointing along one axis. This axis can be 
in any direction, since the theory is isotropic. Take it to be spin up. Denote the concentrations 
of up spin as n, and down spins as n_. Use the symbol n, where o = +. Also, ng is the 
density of spins opposite to o: the overbar denotes a negative value. The net concentration is 
n =n, +n_. Ferromagnetism occurs whenever n, Æ n_. 

Denote by m the fractional magnetization. 


m= +t (6.194) 
ny +n_ 


Values of the magnetization range —1 < m < 1. The spin densities may be expressed as 


n = 50 +m) (6.195) 


n_ = (1 — m) (6.196) 
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The Hubbard model was solved earlier using RPA. The first-order self energy was from the 
Hartree interaction, and gives that E(k) = e(k) + Ung. In this approximation the density of 
up spins is 


l 
Ns =a 2 nko (6.197) 
d*0 l 
= lop Belt Un F 1 (6.198) 


The © integrals range from —n < 0, < n for the sq lattice. 

Let p(£) be the density of states. Equation (6.194) can be used to derive a self-consistent 
equation for the magnetization at zero temperature. Then the occupation number becomes a 
step function n-(e — u*) > O(u* — £) 


p(e) = [Ese - ek) (6.199) 
m == | dep@NO(u* — e + mnU /2) — O(u* — £ — mnU/2) 

u*=u— k U (6.200) 

n= COGN — £ + mnU/2) + @(u* — £ — mU /2)] (6.201) 


The renormalized chemical potential .* is determined by the concentration n as given in Eq. 
(6.201). 

The phase boundary is the point where the magnetization begins: it is the line where 
there is a solution for the above equation with m = 0+. Expand the right-hand side of the 
equation for m in a power series in the small parameter m. This side is proportional to m in 
this limit. Cancelling the factor of m from both sides of the equation gives the equation which 
determines the onset of ferromagnetism at zero temperature 


1 = Up(u*) (6.202) 


Equation (6.202) is the first one to predict a phase boundary. There are several caveats. The 
first is that the equation is very approximate. Only Hartree theory is used in its derivation, 
which is neither sophisticated nor accurate. Secondly, this same result earlier was derived 
earlier: it is just the Stoner criteria found in the prior section using RPA. The Stoner criteria 
was given in terms of the polarization operator P“)(q, œ) in the case that œ = 0 while q > 0 


U 
lim lı +> POG, 0) = 0 (6.203) 
q-> 


At zero temperature, in the indicated limit the polarization diagram is exactly equal to 
—2p(u*). Then the Stoner criteria is identical to Eq. (6.202). It is interesting that the simple 
theory is identical to RPA. 
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The next step is to evaluate the density of states, which is given in terms of an elliptic 
function. First, define a dimensionless energy x = ¢/(4f) and dimensionless state density p(x) 


p(€) = 5 [ dO, i d8,d[e + 2t(cos(0,) + cos(0,))] (6.204) 
1. 

px) = ra d0, l d0,ò[2x + cos(9,) + cos(9,)] (6.206) 

p(x) = ZK 1 — x?) (6.207) 


The elliptic function K(z) diverges when z = 1, which in our case is when x = 0. This point 
turns out to be half-filling: when n = 1. The resulting density of states is shown in Fig. 6.10. 
Note that the dimensionless energy spans the range of —1 < x < 1. The band is half-full at 
x = 0 where the density of states diverges to infinity at the van Hove singularity. The 
divergence is logarthmic and the density of states is integrable through this divergence. In 
fact, the total density of states integrates to unity: 


1 
| = l dxp(x) (6.208) 
-1 


which makes sense since p(s) is the density of states for a single spin configuration. The other 
energy variables are also normalized to 4t: u = U/4t, v = p/4t. 

It may be possible to have a ferromagnetic state with partial magnetization 0 < |m| < 1. 
The onset of the FM state is defined as m 4 0. However, all the examples we have solved have 
the feature that at the phase boundary m jumps to unity. Therefore, that will be assumed here, 
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FIGURE 6.10 Density of states for an electron on a square lattice. 
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in order to simplify the algebra. In the PM state m = 0 and in the FM state |m| = 1. In this 
case ifn, =n and ng = 0, the chemical potential for the majority spins is just v* = v. The 
feature that n = 0 requires that v — un < —1, which can be shown from the definition 


l 
ng = | dxpx)O(v — un — x) (6.209) 
—] 


The integral is zero if the argument of the theta function is never positive. 
The Stoner criteria for the phase boundary between FM and PM is found by solving 
together the two equations 


1 
n= l dxp(x)O(v — x) (6.210) 
-1 


Ue) l 
nee (a) KO ea) 


The first equation (6.210) is solved to give n(v), where v is the dimensionless chemical 
potential which ranges —1 < v < 1. The second equation (6.211) is solved to give the phase 
boundary (U./4t) also as a function of v. This result is combined with the first calculation to 
give the phase boundary (U./4t) as a function of n. The predicted phase boundary reaches 
U, = 0 at n = 1, which occurs because of the divergence in the density of states for the sq 
lattice at half filling. Most other lattices do not have this divergence, and at n = 1 ferro- 
magnetism only occurs above a critical value of the Hubbard parameter U.. The sq lattice is 
unusual in having this phase boundary at U, = 0 at n = 1. All of this discussion is actually 
pointless, since at n = 1 the system prefers to be in the antiferromagnetic state for small 
values of U, rather than in the ferromagnetic state. 


6.4.2. Antiferromagnetism 


Figure 6.11(a) shows a lattice of circles and squares. If all of the points were alike, this 
structure is the square lattice. The area of the unit cell is one square. In the AF state, the 
circles form one sublattice, while the squares form another. Antiferromagnetism occurs when 
the circle sublattice has all electrons with spin up, while the square sublattice has all electrons 
with spin down. Or vice versa. In real space, the unit cell in the antiferromagnetic state is the 
square formed by four nearest neighbor circles. This cell has exactly twice the area of the 
original unit cell. The occurrence of antiferromagnetism, in this case, doubles the size of the 
unit cell. If Ag is the area of the cell in real space, and A,z is the area of the Brillouin zone, 
then the theorem of areas states that (21)? = A Azz. When one doubles the area in real space, 
the area of the Brillouin zone reduces in half. 

Figure 6.11(b) shows the Brillouin zone for the sq lattice. The units assume a bond 
length of a = 1 so the minimal reciprocal lattice vector is G = 2n(+1, 0) or 27(0, £1). The 
Brillouin zone for the sq lattice is the large square whose edges are solid lines. In the AF state, 
the Brillouin zone is half as large. Then its boundaries are the dashed lines in the shape of a 
diamond. The dashed line is also the Fermi surface in the paramagnetic state for half-filling 
(n = 1). This line has an energy e(k) = 0. Also shown in the figure is the vector Q = (n, n) 
which spans the Fermi surface at half-filling. Another similar wave vector Q’ = (n, —7) is not 
shown, but it spans the Fermi surface in the direction perpendicular to Q. The vector Q + Q’ 
equals a reciprocal lattice vector, so that Q and Q’ are regarded as equivalent. They play a 
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FIGURE 6.11 The square lattice. (a) In AF phase, circles have electrons of spin up, squares have electrons of spin 
down. (b) Brillouin zone for sq lattice. 


special role in the theory of antiferromagnetism. From now on the symbol Q is used to denote 
either of the four vectors (+7, +7). 

Let k denote any wave vector in the Brillouin zone for the AF state. These are wave 
vectors in the diamond shaped area in Fig. 6.11(b). The combination of k + Q takes one to a 
state outside of this reduced zone, but still within the original Brillouin zone of the para- 
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magnetic state. In a reduced zone representation of the AF state, the wave vector points k and 
k + Q are the same point. In the PM state they are different points. So the onset of anti- 
ferromagnetism is governed by order parameters of the form 


l 

A,,(Q) = W Zll Cktoo Cko) — (C$ oCko) (6.212) 
1 

Aga(Q) = F È (Cko Ck) (6.213) 


where N is the number of lattice sites. These order parameters are similar to those for charge 
and spin density waves. The AF state is a special kind of SDW. Either of the above two order 
parameters can be used to describe the correlation in the AF state. The feature that a single 
wave vector Q connects large segments of the Fermi surface is an example of nesting. 

The sq lattice is a typical lattice for the description of antiferromagnetism. It naturally 
divides into two sublattices, which doubles the unit cell in real space and halves the Brillouin 
zone. There are other lattices which have a different behavior. Figure 6.12(a) shows the 
honeycomb (hc) lattice which has two different lattice sites in each unit cell. Each site has 
three nearest neighbors. Antiferromagnetism occurs when one site has electrons with spin o 
while the other site has electrons with spin o. In this case the unit cell does not change size, 
and the Brillouin zone in the AF state is identical to that of the PM state. Then the correlation 


(a) 
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FIGURE 6.12 (a) The hc lattice. (b) The pt lattice. 
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functions for the onset of the AF state have Q = 0. The hc lattice occurs in nature as the 
structure of a single plane of graphite, where the vertices are carbon atoms. In graphite the 
chemical o-bonds are the lines in the plane and are fully occupied. The conduction band is 
composed of p, orbitals which are perpendicular to the plane. They bond isotropically to the 
three first neighbors, and the band structure is given accurately by the first neighbor tight- 
binding model with an overlap parameter t = 2.4 eV. The band is half full (n = 1). There is no 
experimental evidence for an antiferromagnetic state, probably because U is too small. Both 
the sq and he lattices are bipartite, which means they have an obvious way of supporting an 
antiferromagnetic structure with alternate up and down spins. 

Figure 6.12(b) shows the plane triangular lattice (pt). The vertices are lattice sites, while 
the lines show bond directions. Each lattice point has six first neighbors. The pt lattice is not 
bipartite. The basic structure in the unit cell is the equilateral triangle. This figure cannot 
support antiferromagnetism: if one vertex has spin up, and another has spin down, then the 
third has an equal tendency to be up or down, and is frustrated. It is believed that the pt lattice 
does not have an antiferromagnetic structure at zero magnetic field at half-filling. All three of 
these lattices (sq,hc,pt) support a ferromagnetic structure. 

Now consider the antiferromagnetic transition for the square lattice. Start by examining 
the function 


1 


Ago = W Z (Clera Cko) (6.214) 
= T E ino)it R (6.215) 
N 


The first expression is the charge density wave correlation function expressed in wave 
vector space. The second equation is the same expression expressed in real space. For the 
paramagnetic state, each site j has equal probability of having an up or down spin. In this case 
(Nj) = n/2. Then the correlation function equals zero unless q = 0. 

A different result is obtained in the antiferromagnetic state. Then one sublattice has all 
spins o while the other sublattice has all spins o. If one sublattice has spin up, and the other 


has spin down, then 


-2 © eR err] (6.217) 
N every other R, 
+ = (+1, 0), or (0, £1) (6.218) 
The largest value of the above correlation function is when exp(iq - T) = —1 which is when 


q = Q. This is another argument for why the wave vector Q is special. 
Bogoliubov theory is used for the calculation of the AF state. Start with the standard 
Hubbard Hamiltonian 


U 
H=} ek) + Cho Chg Crypt (6.219) 
ko kpq 
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In the Hartree approximation for the paramagnetic state, evaluate the Hubbard energy at 
q = 0 to get two energy terms 


U 
Nee [Co Con) Cie Cry + (Chey Cey) Cop Cpr] (6.220) 
p 
H = > [e(k) + Ung|n,, — UNngng (6.222) 
ko 


The last term in the effective Hamiltonian is a constant -UNn ng which is included to get the 
correct total energy. The Bogoliubov approximation overcounts the interaction, so this term is 
subtracted to get the correct value. For the antiferromagnetic state, include the correlations 
with q = Q. In this case the pairing of the last term in (6.219) is 


U i i i t 
n Cp rat Cpt) Ck-o Cry + (Cy Cr) Cp +91 Cpt 


The brackets are Ag,. The effective Hamiltonian in the Bogoliubov approximation is 


H= > [e(k) + Uns |" + > [e(k + Q) + Uns lnk+Qo 
ko ko 


+U Z Agal C196 Cko + Che Croo] — UN(ngng +Agodgs) (6.223) 
O 


Constant terms are again subtracted from the effective Hamiltonian to get the correct ground 
state energy. The summation over k is now only for the Brillouin zone of the AF state. The 
states k + Q are added in order to sum over all of the states in the PM zone. For the sq lattice 
e(k + Q) = —e(k). This Hamiltonian is easily diagonalized by using the transformation 


Ako = cos(9})Cko + sin(0})Ck+Qo (6.224) 
Bko = — sin(O,)Cy + cos(0})Ck+Qo (6.225) 
Cpo = C08(8,) 2x5 — Sin(,) Bo (6.226) 
Cy Qo = SiN(O;)e 6 + COS(O;,)BKo (6.227) 


The two operators Cko and Ck+Qo are replaced by &ko and Bko- The angle 0, is determined 
below. 

This transformation is inserted into the Hamiltonian (6.223). Some algebra gives the 
expression below. We omit writing the subscripts (ko) on each term. 


H = Vifaa + Bt B]Uns + [ata — B? Ble cos(20,) + UAggs sin(20;)] 
ko 


+ (aB + B'a)[UAg, cos(28,) — £ sin(28,)]} (6.228) 


Sec. 6.4 e Hubbard Model: Magnetic Phases 421 


Since « and 6 are to be pure eigenstates, the last term in the above Hamiltonian must vanish. It 
vanishes by selecting the angle 0, to be 


UAgs 
tan(28,) = A (6.229) 


E(k) = ,/e(k)’ + (UAgs)” (6.230) 


H= X {ako %kolUna + E(k)] + Bio BkolUns — EK)]} 
— UN (nnz + Ago4Qgs) (6.23 1) 


The energy spectrum has developed a gap which is twice UAgs. The dimensionless gap is 
A = uAgg. The energy E(k) is similar to that found in the BCS theory of superconductivity in 
Chapter 10. There are some important differences. The main one is that the chemical potential 


u does not go into the expression for E(k): note it is not written as ,/(¢ — u)? + (UAgs) - 


This difference is significant since the energy gap is only important if there are regions of 
occupied electron states where e(k) = 0. In the sq lattice, there are such energy states along 
the dashed-diamond shaped curve in Fig. 6.12(b). However, if the density of electrons n is 
changed away from n = 1 then these zero-energy states may not be occupied. The fact they 
could have a gap becomes less important. Then the tendency for the AF state diminishes. The 
lack of the chemical potential in E(k) is significant, and makes the theory much different from 
the BCS theory of superconductivity. 

As in BCS theory, in Chapter 10, the final step in the calculation is to obtain a self- 
consistent equation for the order parameter Ag,. Again restricting the summations over wave 
vector to the Brillouin zone in the AF state, and using the transformation to the states (a, B) 


Ago = T EL Chgo Cko) + (Cio CktQo)] (6.232) 
= — Y sin(20))le. cays — Blo Buo] (6.233) 

Nk 
Aaa = Aas y È ggg r EW) — HY) — p(B 0k) ~ p”) (6.234) 


The B state has the negative energy with respect to the Fermi surface, and it is the state which 
is most occupied at low temperatures. The expression in brackets in the last equation, which is 
the difference in occupation numbers for the alpha and beta states, is negative. The solution to 
this equation requires that 4g, = —Agg. This choice is in agreement with Eq. (6.212). The 


maximum order parameter occurs when Ag, = —Agg. The self-consistent equation for the 
energy gap in the AF state is 
U 
l= — y*) — — * 
BO a Ine(—E(k) — p*) — np(E(k) — 2] (6.235) 
U l inh(BE(k 
sinh(BE(k)) (6.236) 


~ N X E(k) cosh(Bu*) + cosh(BE(k)) 


At zero temperature the opression is written as 


-7 San a [O(E(k) + u*) — O(u* — E(k))] (6.237) 
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Again let the density of states in the sq lattice be p(e). Recast this equation as 


© E © sihGE 
I= uj 4° Ele) cosh(Bu*) + cosh(BE) (6.238) 


E(e) = ,/ £ + (Uoo) (6.239) 


Note that the upper limit of integration is now zero, since the summation over k only 
extended over the reduced Brillouin zone. The effective chemical potential u* = u — Ung is 
determined by requiring that the number of particles remain n 


ko 

= D [hots + BhoProl (6.241) 
ko 

E 0 cosh(BE) 

= | #0 SSG) eos) (8-242) 


If u* = 0 then the right-hand side integrates to one, as it should. This identity is valid 
regardless of the value of the energy gap. For values of n away from n = 1 one has to solve 
this equation in order to obtain n(u*), which therefore gives u*(n). 

The gap equation is (6.238). It is identical to the Stoner criteria. Recall that the Stoner 
criteria for the AF state is given in terms of the polarization function P,(Q, 0) for a single 
spin component 


1 = —UP,(Q, 0) (6.243) 
ol ny — "ko 
P,(Q, 0) = W Lekt ak +0) (6.244) 
_2 ny 
“NY Ew) —KF® (8:249) 
The formula simplifies when using the feature that e(k + Q) = —e(k). 
oiy 
P,(Q, 0) = N E(k) (6.246) 
4t 
= l de E nple — u*) (6.247) 
—4t 
0 . 
_ p(e) sinh(Be) 
To |. de cosh(Bu*) + cosh(Be) (6.248) 


Here the summations over k extend over all of the Brillouin zone of the PM state. The Stoner 
criteria is valid only at the transition temperature where the gap is zero. In this case the Stoner 
theory is identical to (6.238) derived using the Bogoliubov approximation. 


6.4.3. An Example 


It is useful to give an example of these results. A simple model is best, in which all 
results are analytical. To this end assume that the density of states p(€) is a simple constant 
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over the range of energy —W <€< W and is zero elsewhere. The normalization which 
integrates to one electron is 


1 
— -W<e<W 
ple) = į 2W ses (6.249) 


0 |s)>Ww 


It is convenient to use dimensionless units such as x =&/W,v= u/W, and u = U/W. 
Assume zero temperature so that Fermi functions become step functions. 

In the paramagnetic state, then ng = ns =n/2. Define p* = p — Un/2 and v* = 
v — un/2. The density of particles is 


* 


u 
n= 2| dep(e) = 1 + v* (6.250) 
-W 


The factor of two is for spin degeneracy. Define the reduced ground state energy per electron 
as &; = Eg/(NW) where the symbol j denotes P,F, or A for our three possible states. For the 
PM state 


2 
Ec = DEG + ee — uN (=) (6.251) 
2 pe U 2 
Ep = a de| e + US| P(e) - 5 (5) (6.252) 
n un? 
un? 


In Eq. (6.251) the last term is subtracted since the Hubbard term is taken twice in the kinetic 
energy part, where it is overcounted. In the last expression (6.254), v* = (n — 1) is used to 
obtain an expression for the reduced ground state energy as a function of occupation number 
n and the Hubbard parameter (u = U/W). 

The assumption in the ferromagnetic state is that the up (o) and down (6) spins have 
unequal occupation. Assume that up is dominent n, > ng. Again define the magnetization as 
m = (n —nz)/n. An interesting result is obtained by separately calculating these two 
occupation numbers by integrating over the density of states 


u—Uns 
ns = l dep(s) = + [u + 1 — uns] (6.255) 
-W 


u—Uns 
ng = l dep(e) = + [u +1-—-—un,| (6.256) 
-W 


Subtract these two equations, and the difference in the occupations is the magnetization, 
which gives the interesting result 


u 
nm = nm; (6.257) 


This equation has two solutions for n Æ 0: either m = 0 or else u = 2. The latter is the Stoner 
criteria: 1 = Up(0) = U/(2W). What these equations actually mean is that there is no 
solution for m Æ 0 until u > 2. At this point then m jumps to one. The onset of the FM state is 
given by the inequality U > 2W. The calculation of the ground state energy is rather easy. 
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Since n = 0 there is no Hubbard energy: all the spins are alike to avoid any Hubbard energy. 
The ground state occupation and energy for the FM state are 


Ng =N= (1 +v) (6.258) 
Eg = n| deep(s) (6.259) 
6p =4(Vv -D=nn-1) (6.260) 


where the expression for v(n) is used to obtain the last expression. The corrrectness of this 
result can be confirmed by comparing the ground state energies for the FM and PM states. 
The FM state has the lowest energy whenever p < &p which gives 


| un? 
n(n — 1) < n(n — 2) + PE (6.261) 


which has the solution that 2 < u for all values of occupation n. The phase diagram for a 
system with just two ordering possibilities, PM or FM, is quite simple. The boundary is a 
horizontal line at U/W = 2. Above the line is the FM phase and below it is the PM phase. 
The line is flat because a constant density of states is assumed. The curve is actually a plot of 
U(n) = 1/p[p(n)]. If the density of states is not constant, then the phase boundary has 
curvature. 

The third phase is antiferromagnetic (AF). First evaluate the Stoner criteria for the phase 
boundary between the AF and the PM states 


| 
l= -u | TE (e) = — É Iniv| (6.262) 
-w € 2 
U(n) = -— (6.263) 
Inja — 1| 


This expression has three singular points. At n = 0, 2 then In|jm — 1| = In(1) = 0 and the 
value for U, diverges to positive infinity. The third point is at n = 1. There one has In(0) 
which diverges to negative infinity, so that U,(1) = 0. This behavior gives the phase boundary 
shown in Fig. 6.13(a). There are listed three phases where they exist alone. In the top-middle 
region there could exist either AF or PM. The ground state energy of the AF state must be 
calculated in order to compare the ground state energies of these two phases. The calculation 
of the properties of the AF state proceeds in several steps, since there are several quantities to 
determine. The first is the relationship between the occupation number n and the reduced 
chemical potential v*. The relevant expression at zero temperature is 


0 

n= | def@(v* — Ve + A?) + @(v* + Vx + AP) (6.264) 
-1 

A = udas (6.265) 


Recall that the energy is E = ,/ £? + (UAgs)” which when divided by W becomes yv x? + A’. 
The upper limit of integration is zero, rather than one, since the integral is over the reduced 
Brillouin zone which only goes halfway in energy. The two theta functions are the Fermi- 
Dirac occupation functions for the «, and B, operators. First consider the case that n < 1 so 
that v* < 0. Then the first theta function is always zero, while the second gives v* + 


V x2 + A? > 0 which can be solved for negative x to give 


—/(v*)? — A? > x (6.266) 
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FIGURE 6.13 (a) Preliminary phase diagram for the Hubbard model. Region with AF and FM together is to be 
determined. (b) Final phase diagram for the Hubbard model. 


so that the integral for the occupation number gives 


n=1—,/(v*)? — A? = (6.267) 

v* = —/A*+(1—ny (6.268) 

This equation provides one relation between v*, A and n. A similar derivation for n > 1 gives 
n=1+,/(v*)? — A’ (6.269) 

vt = /A*+(1—ny (6.270) 
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The second step in the derivation of the AF state is to solve the integral (6.237) for the gap 
parameter A. In dimensionless units it is the equation 


0 dx x / > x Za 
| aloo + x7 +A )- O(v — x7 +A )] 
n—l 


In(x +V +A? ) (6.271) 


A2 Oax AL 
_¥ (ve te V UM (6.272) 


2 vA +1-!1 


The upper limit of the x integral for v* < 0 is (n — 1), as shown in the derivation of the 
occupation number. Solve the above equation by defining S = exp(2/u), and then expo- 


nentiate the equation. Also define y = v 1 + A’ 


JA? +(n—-1 =SQV—-1)+(0—-n) (6.273) 


After squaring this equation, there is a quadratic equation for y, which is solved to obtain 


—S(1 — n — S) + |SU — n) — 1| 

= S 6.274 

y Yo] (6.274) 
If the factor of S(1 — n) — 1 is positive, then y= 1 and A = 0. In this case there is no 
AF state. This result is expected, since the criteria that S(1 — n) > 1 is equivalent to 
u < —2/ ln(1 — n). From the Stoner criteria, the system is not in the AF state for this small 
value of u. For S(1 — n) < 1 there is a solution with y > 1 which gives the order parameter 


y = VA? +1 = coth(2/u) — (1 — n)csch(2/u) (6.275) 


A? = {oth — fay 
sinh(1 /u) 


JA? + (n — 1)? = esch(2/u) — (1 — n)coth(1/u) (6.277) 


This nice result depends only upon n and u. It can be evaluated, and then the result used to 
find v* in Eq. (6.268). 

The third and final step in solving the properties of the AF state is to evaluate the integral 
for the ground state energy, again at zero temperature. In dimensionless units it is the integral 
in (6.231) 


(6.276) 


npk _ vera ]o(v tV 4A )} — u(n/2 + A?Ju (6.278) 
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The constant term A*/u has changed sign since Ags = —Agu- The integral can be evaluated 


easily by changing integration variables to & where x = —Asinh(&) and Vx? +A? = 
A cosh(&). 


n—l Éi 
l dxV x? + A? = a| dé cosh? € (6.279) 
—] n 
4 [é + 1 sinh(26)]? (6.280) 
— 2 2 En 
= A’ /u +}[V 1 +4? — (1 — ny (1 — n? + 4°] 
1 — n = Asinh(E,) (6.281) 
1 = Asinh(E,) (6.282) 


Collecting these results gives 


e4 = 77 +41(1 -AN1 ne +A A n HA A Mn + - NVI A 


(6.283) 


n+l E — [2 — n(2 — n)]coth(1 no) (6.284) 


u 
4 
The first expression is the direct result of the prior algebra. The second equation is found by 
using Eq. (6.276) for A° and doing more algebra. It has the advantage that the ground state 
energy is a direct function of the variables (n, u). This expression is the final result needed for 
the calculation of the phase diagram in the Bogoliubov approximation. For each value of n 
and u, one can calculate the order parameter A and also the ground state energy per electron 
& ,. For u > 2 the AF energy is compared to that of the FM state [fp = n(n — 1)] which is 
independent of u. The resulting phase diagram is shown in Fig. 6.13(b). For u > 2 the FM 
state occupies an increasingly large part of the phase space, for different values of n. The 
region of AF state gets shrunk to an increasingly small region near to n ~ 1. The reader is 
cautioned that these results are quite approximate, and are based solely on the Bogoliubov 
approximation to the initial Hamiltonian. Probably the AF phase does not extend down to 
zero value of U at n = 1, but instead there is a critical value of U, for the onset of the AF 
state. This feature is discussed in the following section. 

Figure 6.13(b) shows why the Kanamori theorem is perhaps reasonable. As U becomes 
large the FM phase approaches the line n = 1 on both sides. His theorem is that in the limit 
U — œ, then a bipartite lattice with N sites will be ferromagnetic if the number of electrons 
differs slightly from N. 


6.4.4. Local Field Corrections 


The homogeneous electron gas was discussed in Chapter 5. There the RPA was found to 
be a poor approximation for most quantities. It was necessary to add some additional 
correlation into the response functions, by postulating the existence of Hubbard local field 
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corrections: Gy(q) for charge fluctuations, and a similar function for spin fluctuations. A 
similar approach to the Hubbard model would suggest the RPA result is changed to 


Pq) 


— r 4— (6.285) 
1 -g5 P O 


%4) = 


(1) 
Xs(q) = 7w (6.286) 
1 +g 5 PO 


where P“!)(qg) is the usual polarization function of the electronic system, as calculated using a 
single bubble diagram without vertex or other corrections. A better evaluation of the polar- 
ization diagram would include vertex corrections. The present idea is to call these vertex 
corrections the constants g,,, which just multiply the correlation function. These vertex 
constants will be a function of U/W. This simple idea follows directly from the Hubbard 
approach. The question is how to calculate these vertex constants. 

A simple and clever way was suggested by Vilk and Tremblay (1997). They used sum 
rules on the correlation functions to provide formulas for the vertex constants. These sum 
rules are derived from the exact definitions 


tæi) =E | deToa DPA. 0) (6.287) 


. l a 
19. i9,) =~ | dret Tp a Doa 0) (6.288) 
0 


The sum rules are derived by taking a four-vector summation over these correlation functions: 
both over wave vector q and imaginary frequency q,. First do the summation over frequency. 
This step simplifies the expression, since the summation over frequency gives a delta function 


d(T) 


1 B> e'n" = (tT) (6.289) 
D Xe(q; i9,) = — = (pla 0)p,(—q, 0)) (6.290) 
3a iq,) = (p,a, 0)p,(—q, 0)) (6.291) 


The next step is to write the density operators in real space such as 


p.(q) = 2 [n +n let (6.292) 
p,(q) = Ir i — nj Jet ® (6.293) 
1 
y > (Pe(@)Pe(—@)) = =} in Hn (6.294) 
q 


y EOD) = 2 [r — nV (6.295) 
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These expressions are evaluated using Ne = Mjo to finally get 


l 
(nn) = N 2 (Ajn) (6.297) 
J 
1 
BN x Le(G, ign) = —[n + 2(n;n,)] (6.298) 
l 
BN de Xo(4 ign) = Ln — 2p) (6.299) 


These two results are close to the final formula for the sum rule. The one for the spin 
correlation is in its final form. A change must be made in the formula for the charge 
correlation. In the expression for the charge fluctuations, there is a contribution to the sum 
rule which is omitted in RPA. This contribution is the term with q = 0. Then the density 
operator is p(q, T) = N,, which is independent of t. This term gives a contribution which is 
N2 Sg—00n-0- It only contributes at zero frequency and at zero wave vector. It is usually 
omitted when considering fluctuations. However, this term is included in the sum rule which 
sums over all q and all n. Since it is omitted from RPA, also omit it from the sum rule. 
Rewrite the sum rule for the charge correlation to be 


BN & Ke ign) = =n + 217, — 1°] (6.300) 
q9, 


The two sum rules are Eqs. (6.299) and (6.300). The derivation is unfinished without an 
algorithm for determining the important correlation (n,n,). Vilk et al. (1994) argued that it 
was just g, = (n,n,)/({n,){n,)). The sum rule for the spin correlation is now just a self- 
consistent nonlinear equation for (n,n,). It is determined first. Afterwards, the charge fluc- 
tuation sum rule can be evaluated to give g,. In this way the two vertex constants g, , are 
found self-consistently. Results of Vilk and Tremblay for the two-dimensional sq lattice at 
half-filling (n = 1) are shown in Fig. 6.14. They plot the effective Hubbard constants 
U. = g,U, U,=g,U as a function of U, where the hopping energy t= 1. The charge 
interaction U. is significantly increased by the vertex interactions, while the spin interaction 


20 


0 = 
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FIGURE 6.14 U, and U, for the Hubbard model on the sq lattice as a function of U at half-filling (n = 1). From Vilk 
and Tremblay (1997) (used with permission). 
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U, is significantly decreased. Similar changes occur at other values of filling n. Obviously, 
this change in the effective interaction changes the locations of the phase boundaries. 

Their theorem about the lack of an FM phase is based upon these types of calculations. 
As U -> œ the effective U, saturates at a small value, which is less than needed for the onset 
of the FM phase. 

They also calculate the self-energy of the electron including the scattering by charge and 
spin fluctuations 


alp) = Uns — ggg È Weta) + UDO +9) (6.301) 
q 


where a four-vector notation is used, as usual. The summation is over wave vector q and 
frequency q,. Note that only one vertex is dressed, so that the interaction with the charge 
fluctuations is UU,, rather than UŽ. The latter would be overcounting the vertex corrections. 
They find this simple analytic self-energy function agrees quite well with numerical results 
obtained using quantum Monte Carlo simulations. 


Problems 


1. Consider the bound states of the Kondo Hamiltonian for a localized spin one-half particle 
interacting with a single particle in a band of continuum states: 


H = Y 8koCkoCko = È SCE, Cer — Ch, Cey) +I (Chh Ce, SOP + Ck Cirp S] (6.302) 
GS kk’ 


where S®, S and S refer to the localized spins. Show that the following are exact bound states of 
H: . 


>, = Ya Cis | t) (6.303) 
b) = LDC iH) (6.304) 
3 = Yale! 4) = CLI] (6.305) 
b4 = D elC Ly + Ch 14] (6.306) 


where a,, bk, d,, and e, are coefficients and |1), | |) are configurations of the localized spins. Find the 
eigenvalue equations for each case, and identify the spin triplet and singlet states. 


2. Evaluate the integral A(e) in the Kondo model for each g(¢) and show that the coefficient of the 
term ln(e/W) is proportional to p(0). Also verify that the integral of g(e) over the entire band 
(—W <eé < W) is unity. 

g(e’) 
€ —E 


0 
A(£) = l , dé’ (6.307) 


(a) g(e)=1/2W) 
(b) g(s) = 2VW?2 — &2/(nW?) (6.308) 
(c) g(e)=(W — |e|)/W? 


3. Construct the zero-temperature collective state for the Kondo Hamiltonian when the local spin has 
S = 1. Find the states with j = 3 and j = 4. Which is preferred for J < 0? 
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4. Consider the collective state for the antiferromagnetic Kondo model when the bands are half-filled. 
Calculate A = —dE,,/W by solving (6.48) using the density of states in part (a) and (c) of Problem 2. 
Make a graph of A vs A defined in (6.52). Which case predicts the lowest binding energy? 


5. Assume that an f level is split by the spin—orbit interaction into states with degeneracy 8 and 6, 
separated by an energy € = 0.3 eV. Derive an expression for 56 = g; — dE assuming that 6 < €. 


6. Derive the high-temperature (SC ~ 1/T) and low-temperature (6C ~ T) limits of the specific heat 
per impurity in the Anderson model. 


7. Derive some vertex corrections to Eq. (6.113) by taking n’ =n+1. Can you draw a Feynman 
diagram for these processes? 


8. Derive the energy current for the Anderson model. Then use it to derive the expression (6.122) for 
the Seebeck coefficient. 


9. Calculate U and W for metallic aluminum. Estimate the bandwidth W ~ Ep. Calculate U using a 
charge density for the 3s atomic state in aluminum of p,,(r) = Nr“ exp(—ar), where « = 2.8a.u. What is 
the ratio of U/W? 


10. How large a negative number can you make v, = v(q) + U/2 for the square lattice in two 
dimensions? Assume that the charge distribution on each atom is a Gaussian of the form 
o(r) = exp(—r? /b)/ (nb?) and vary b to find the largest negative value of v, at the corner of the 
Brillouin zone. 


11. Evaluate the function P“(q, 0) for the first-neighbor tight-binding model of the two-dimensional 
square lattice. Choose q to be the corner of the Brillouin zone Qa = n(+, +). Show that for this case that 
e(k + Q) = —e(k) for all values of wave vector. Then show that P“)(Q, 0) is infinity at zero temperature 
when the band is exactly half full. In this case the Stoner criteria is satisfied for any value of U. 


12. Use RPA in the extended Hubbard model to derive expressions for the susceptibilities x, | (q). 


13. Find P(g, œ) for the jellium model of the free-electron gas, in the limits: (a) x = 0, and (b) x 
becomes large, where x = 0/(qv;). 


14. Evaluate the charge fluctuations in two dimensions using (6.155) in the limit of large x. For d = 2 
the small wave vector limit is v(q) —> 2me*/(qgA,) where A, is the area of a unit cell. 


15. For the pure Hubbard model (v(q) = 0): 


(a) Prove there are no RPA terms in the electron self-energy with two bubbles. 
(b) Start from the all-spin form in (6.175) and show that there is a term such as U?P’. 
(c) Find the exchange graphs which cancel the term in part (b), in order to agree with (a). 


16. Derive the energy band dispersion e(k) for the hc lattice in the tight binding approximation. 
Include only first neighbor interactions, which are assumed to be isotropic. Find the reciprocal lattice 
vectors, and the shape of the Brillouin zone. What are the minimum and maximum values of ¢(k). 


17. Derive the energy band dispersion for the pt lattice in the tight-binding approximation. Include 
only first neighbor interactions, which are assumed to be isotropic. Find the reciprocal lattice vectors, 
and the shape of the Brillouin zone. What are the minimum and maximum values of e(k). 
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18. Use the order parameter 4,;(Q) to solve for the AF phase of the sq lattice. In this case, the 
effective Hamiltonian is 


H = 2 [e(k) + Unslnko + 2 e(k + Q) + Unslnkos -U 2 Aoa (®IC} oo Ck + Cio Cros] 
Oo oO ko 


(6.309) 


where k is summed over the AF Brillouin zone. One should get the same expression for the gap equation 
as found using the order parameter A,q. 


19. Derive a theory for the AF state in the hc lattice, where there is no Q vector since the Brillouin 
zone does not change size in going to the AF state. The first step is to choose a suitable order parameter. 


20. Assume the density of one-particle states has the form p(s) = (2/nW*)/ W? — £?. Use Bogoliu- 
bov theory to find the phase boundary between the PM and FM states. 


21. Use the density of states of the prior problem to find the boundary of the AF phase using Stoner 
theory. 


22. A single sheet of graphite has the honeycomb lattice which is bipartite. The sigma bonds are fully 
occupied by electrons. The n-bonds are half full which makes it a conductor. There is one electron per 
site when undoped. The density of states vanishes at half-filling. Assume it has the form: 


for -W <E<W (6.310) 


PÆ) -E 


(a) Derive the relation between D and W assuming the band holds two electrons. 
(b) Find a formula for the chemical potential as a function of filling u(n). 
(c) Derive the Stoner criteria for the onset of the FM state U.(n) in the Hubbard model. 


23. Derive the Vilk-Tremblay sum rules for the extended Hubbard model. 


24. Solve the Hubbard model approximately by taking 
Ho = -u 2 CoCo + Ud ninj 


V=-t>°C'. C 
jso 


j+8,o “jo 


Calculate the Free energy to second order in V which are the terms (Q9, 2,, 2). 
(a) Use perturbation theory to evaluate the free energy of the Hubbard model to O(U?). 
(b) Next show that the double site occupancy can be found from 
— 10 
=- NaU 
(c) Use the result of (a) to show that the above average behaves as (n/ 2)°[1 — U/ Uc] and estimate 
the critical value U.. 


(njin) 


Chapter 7 
Electron—Phonon Interaction 


7.1. FROHLICH HAMILTONIAN 


The Frohlich Hamiltonian describes the interaction between a single electron in a solid and 
LO (longitudinal optical) phonons: 


H = oop pol po + Oy X alag +o | Chiao Cho (ag +a‘) 
q 


Arah (ho) p? 
M. = _P 7.1 


ef(m\'*/1 1 
_& jt : 7.2 
=F (ae) Ea) 72) 


This Hamiltonian was derived in Chapter 1, with the form of the interaction given in Sec. 
1.3.5. The LO phonons are usually represented by an Einstein model, 1.e., the phonon 
frequency @) = @,0 1S taken to be a constant. Since there is a single electron, the Hamil- 
tonian may also be written as 

B p i My PA Bi 

H= pt O02 aat g 

where r and p are the conjugate coordinates of the electron. The unperturbed electron is taken 
to have free-particle motion with an effective mass m. Since there is only one electron in the 
problem, the results are independent of the statistics of the particle. The same results are 
obtained for any fermion or boson in the solid, such as holes, positions, etc., as long as they 
are free to move. The phonon modes are unaffected by the one electron in the solid, so the 
phonon self-energy is zero [actually of order O(1/v)], and the phonon Green’s function 2 is 
always J. The model also assumes that the motion is isotropic in direction and that the 
energy bands of the solid are nondegenerate. These rather restricted conditions describe what 
is called the Fröhlich polaron problem. The model actually applies in some cases to 
conduction bands of semiconductors and ionic solids which have their minimum at the I 
point and have an isotropic effective mass. For other symmetry points in the solid, one usually 
has to improve the model by taking into account anisotropy in the effective mass or degen- 
eracy of the bands. The latter is important for holes in semiconductors. The Frohlich polaron 


(a, +a") (7.3) 
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problem was an important problem in mathematical physics during the 1950s. Numerous 
mathematical techniques were tried out on this problem. Several are described here: Bril- 
louin—Wigner perturbation theory, Rayleigh—Schrédinger perturbation theory, strong coupling 
theory, and linked cluster theory. The Green’s function method will be shown to be equivalent 
at zero temperature to Brillouin—Wigner theory. Several other methods were tried, including 
that of Low et al. (1953). The problem was most accurately solved by Feynman (1955). He 
introduced a variational method based on path integrals. After lengthy algebra, he obtained a 
result which even today is the best available. His theory is very lengthy and is not presented 
here. The Feynman results will, however, be used as the standard to which other theories are 
compared. The methods not discussed here, such as that of Low et al., give poor results when 
compared with Feynman’s method. 

The polaron Hamiltonian (7.2) describes the motion of a particle while it is linearly 
coupled to a system of boson particles. In the Fröhlich Hamiltonian, the bosons are optical 
phonons in a polar solid. The classical picture has the particle exerting forces upon the ions, 
which respond and move. The ion motion creates new forces which act back upon the 
particle. The nonzero ion frequency makes the reaction forces, of the ions on the particle, 
retarded in time. The quantum nature of the phonons makes these forces occur in discrete 
units. In both the classical and quantum pictures, the ion motion is pictured as a polarization 
of the surrounding medium by the particle. The particle must drag this polarization with it 
during its motion through the solid, which affects its energy and effective mass. 


7.1.1. Brillouin-—Wigner Perturbation Theory 


The Brillouin (1932, 1933)-Wigner (1935) perturbation theory method is a historical 
predecessor of the modern Green’s functions method. It is equivalent to solving the equation 
for the energy spectrum E, of a particle of momentum p, 


p = Ep + Relè, elp, Ep)] (7.4) 


where È «(p, E) is the retarded self-energy. The problem is isotropic, so the functions depend 
only upon the magnitude of the wave vector. 

The method ignores the imaginary part of the self-energy of the electron. The particle 
properties are actually described by the spectral function: 


1 
A(p, E) = —2 in| a (7.5) 


If the imaginary part of the retarded self-energy is zero, it is replaced by an infinitesimal value 
id, and the spectral function just becomes a delta function, whose argument is (7.4). The 
Brillouin—Wigner method is exact if the imaginary part of the retarded self-energy is zero. 
Then if the retarded self-energy function Łe is found exactly, the exact result is obtained for 
the particle motion. 

The self-energy function cannot be obtained exactly unless the Hamiltonian can be 
solved exactly. This result has not yet been achieved for (7.2). In practice, one usually 
evaluates a few terms in the perturbation expansion and thereby obtains an approximate 
DLret(P, E). Equation (7.4) is solved with this approximate self-energy, which gives an 
approximate solution. This procedure is Brillouin—Wigner perturbation theory. 

For one particle interacting with a set of optical phonons, the imaginary self-energy does 
vanish at zero temperature. This statement is only true for particles whose kinetic energy is 
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less than the phonon energies @ . For a particle of initial energy E; to emit a phonon, it must 
go to a final state Ey = E; — Mp. However, if E; is less than @ , this equation cannot be 
satisfied since the energy Æ; cannot be below the bottom of the band. The process is forbidden 
by energy conservation. Of course, a particle can absorb a phonon and increase its energy to 
E; = E; + @. This step is always possible as long as there are phonons in the system. But the 
number of phonons is proportional to the thermal occupation factor, 


l 


No = epo% — 1 


(7.6) 
which vanishes as T — 0. At T = 0, the low-energy particle can neither absorb nor emit 
phonons. Since these are the only two loss mechanisms, the particle cannot lose or gain 
energy. Its mean-free-path is infinite, and the imaginary part of the retarded self-energy is 
zero. Brillouin—Wigner perturbation theory is exact at T = 0. Of course, it is exact only if the 
retarded self-energy is found exactly. In practice, that is impossible, and Bnillouin—Wigner 
perturbation theory is usually a poor approximation. 

The self-consistent energy function E, is a smooth function of momentum at small 
values of p. It can be expanded in a power series: 


2 
E, = Ep + 5 -+ O(p') (7.7) 
The quantity E, is the downward shift of the band minimum from polaron effects. Eo is 
negative and gives the amount an electron with zero momentum lowers its energy by inter- 
acting with phonons. In Sec. 4.3 the self-energy of a fixed particle is -Ja MG /@q, but it will 
be different now that the particle can move. The quantity m* comes from the coefficient of the 
p? term in the momentum expansion. It is called the effective mass. In solids there are several 
effective masses. Each energy band will have its own curvature at the band minimum, which 
defines the effective band mass m,. The band mass should be used for m in (7.2) and (7.3). 
The “b” subscript is omitted, but m is identical to m,. The effective mass m* in (7.7) is that 
resulting from the band curvature and polarons. It expresses the way that m (i.e., m) is 
changed by the polaron interactions. A formula for m/m* was given previously in (3.160). 

The first term in the perturbation series for the self-energy has one phonon in the self- 
energy diagram. The Feynman diagram is shown in Fig. 7.1(a). From the rules for 
constructing diagrams, the self-energy is 


Me 1 
IO, ip) = BL aL GM +A Pe + iI) IG: idm) (7.8) 
q 14m 
The Matsubara summation was given in Sec. 3.5: 
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FIGURE 7.1 
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At zero temperature, Ny = 0. The electron occupation factors np are all zero if there is only 
one particle in a band. Set ip = E — u + ið so the real part of the retarded self-energy is 
(v > œ) 


2 3 
REQ, £)] = 59 e (7.10) 
Ory J q E—@ —(p+q)/2m 
This result is also easy to obtain from the zero-temperature techniques of Chapter 2. The 
energy E is now measured from the bottom of the conduction band. A free particle has E = 0 
when it has no kinetic energy. The wave vector integrals may be done exactly. Results are only 
given for E < @,. The formulas at nonzero temperature, for all values of E, are given else- 
where (Mahan, 1972). Let v = cos(9) = p - q be the angular variable. The three-dimensional 
integration is d?°q = q°dqdvdọ and the integral f dọ = 27. The two remaining integrals are: 


(1) My [S f l 
RDE D=- a| dv — 
PP D= A A a o EFO F aE p/m 
The integrand remains unchanged if g —> —q, v —> —v. The answer is unchanged if the q 
limits are altered to (—oo, oo) and the result is divided by 2; the two intervals (—oo, 0) and 
(0, oo) contribute equally. The integral can be simplified by changing the q variable to 


x= ve (7.11) 
M3 /2m [” 1 
Re[£ (p, EJ] = -= | ax | dv ——————— 7.12 
The x integral can be done first. The integral is of the form 
OO 
j = (7.13) 
or +tA SA 


where A represents everything else in the denominator of (7.12). The integral is nonzero only 
when A > 0, which is always the case when @,) > E. The remaining integral is just an arcsine: 


1 
2 € 
| Ot — 2 (7.14) 
-ilo -E +e LN s o- E+E, 
3/2 E 
Rel ED (p, E = -EL sin | — 2 7.15 
e[ ret (p )] J& sin Wy _ E 4 Ep ( ) 


Equation (7.15) is the result for the one-phonon part of the retarded self-energy when E > @ọ. 
The parameter E, is obtained from the self-energy at p = 0. By using the expansion for the 
arcsine, sin™! (x) = x + x3/6 + O(x>), the p > 0 limit is 


Re{X“(0, EJ] = -——.— 
el ret ( )] /@) — Eo 


Equation (7.16) is the result for a zero-momentum particle. The result for a fixed particle is 
quite different. In Sec. 4.3 it is shown to be 


(7.16) 


La (7.17) 
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The integral does not converge unless some cutoff is used at the high values of momentum. 
Of course, the cutoff could be the size of the Brillouin zone or alternately the spatial extent of 
the localized particle wave function. No cutoff is needed in the free-polaron problem because 
of the recoil energy of the electron. The zero-momentum result is different from the result for 
a fixed particle, a fact which is easily understood from the classical picture. The ions in the 
vicinity of the electron get either repulsed or attracted to the electron. Their motion changes 
the potential field felt by the electron. If the electron is fixed, it does not respond to this 
change. The ions seek their new equilibrium position without any change in the potential 
exerted upon them by the electron. A free electron can respond to the changing potential of 
the ions. When the ions move in response to the potential exerted upon them by the electron, 
the electron in turn begins to move in response to the changing potential of the ions. The final 
solution describes the coordinated motion of the electron and ions. For a free electron, the 
recoil of the electron makes polaron theory harder. A free polaron of zero momentum does 
not mean the electron is fixed. It is constantly moving, in response to the continual interplay 
with the force field of the ion. For p = 0, the motion has, on the average, zero momentum. But 
the electron is moving in a random, stochastic fashion. When it has nonzero momentum, it 
still has this random part of its motion, even as it is drifting. The total polaron momentum 


P=p+) qalaq (7.18) 
q 


commutes with the Hamiltonian and therefore is a constant of motion. Momentum is an 
acceptable eigenvalue to assign to the polaron. 

The imaginary part of the one-phonon self-energy may also be obtained from (7.9). With 
ip, > E — and np = 0, the imaginary part is 


mM? fæ 
-im[E(p, £) = | q 


Or P {N SCE + Wo — Egg) + (No + LOE — Wp — Epiq)} (7-19) 
There are two terms in the brackets. The first term corresponds to the absorption of a phonon 
by the electron. This term is proportional to the phonon number density No, which vanishes at 
zero temperature. The second term comes from the emission of a phonon by the electron. It is 
proportional to the factor Nọ + 1, which is nonzero even at T = 0. However, this term is 
nonzero only if E > œ, so that the particle has more energy than the phonon, and is zero 
when E < @ . Therefore, Im[x? (p, £)] = 0 at T = 0, unless E > œọ. The latter is possible in 
a high electric field. 

The Tamm—Dancoff (TD) approximation constitutes solving Brillouin—Wigner pertur- 
bation theory with only the one-phonon self-energy: 


E, = £, + Re[Z Q (p, E,)] (7.20) 


For zero momentum, the TD approximation gives the particularly simple equation 


woe” 
Eg = — ——- 7.21 
=a (7:21) 
The above is a cubic equation for Ey, which is easily solved. These results are summarized in 
Table 7.1 for several values of «x. They are labeled Erp. 
The other columns are the Feynman results Ep and also the results of Rayleigh- 


Schrödinger perturbation theory EQD, EQ) is also evaluated in the one-phonon approximation 
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Table 7.1 Ground state energy of polaron in various 
theories: Feynman (Ep), Tamm—Dancoff (Erp), and 
Rayleigh—Schrédinger (ES) for n phonons. 


x EF /® Erp/®0 Exe/ Wo Exe/ Wo 
1 —1.013 —0.76 —1.00 —1.016 
2 —2.055 —1.31 —2.00 —2.064 
3 —3.133 —1.80 —3.00 —3.143 
to provide a fair comparison. This equation is very simple, EQ = —A&M,, as will be shown 


below. EY also includes the two-phonon terms. The table shows that the Tamm—Dancoff 
approximation is a poorer approximation than Rayleigh—Schrédinger. It provides the larger 
error when compared with the Feynman result, particularly for values of x ~ 1, which is 
called the intermediate coupling regime. 

The Tamm—Dancoff gives poor results in the intermediate coupling regime. The poor 
answers occur because a gap was introduced in the excitation spectrum. The energy 
denominator in (7.10) has the difference between the initial energy of the particle £ and the 
value in the intermediate state Ep+q: However, E has the minimum value E, < 0, while Ep+q 
has the minimum value of zero. In order to get to the excited state, in this approximation, 
there is an excitation energy of Eọ. Tamm-Dancoff is a poor approximation because there 
really is no excitation energy in the spectrum. The Tamm—Dancoff approximation just 
happens to insert a gap, which explains why it gives poor results for intermediate coupling. 

The above results are suitable for a single particle in a band. A different approach must 
be used if there is a small but nonzero density ng of particles in the band. Then one must 
consider the change in the chemical potential ou. In insulators, the chemical potential u has a 
negative value if the band minimum E£,, is defined as zero energy. If nọ is the density of 
particles, and A is the deBroglie wavelength, then for the noninteracting system with 
Maxwell—Boltzmann statistics one has that u — E„ = kgT In(m A). u — E,, is negative if the 
argument of the logarithm is less than unity, which happens when nọ is small. If polaron 
interactions cause a change òE, in the energy of the band minimum, and if nọ is unchanged, 
then the chemical potential must change according to du = ðE. 

In the polaron problem, E,, = 0 and dE, = Ey. How does op enter into the calculation? 
In going from (7.9) to (7.10), there was an analytic continuation ip —> E — u — òu + io. 
Effectively E is replaced by E — dy in all of the following equations. Since du = Ep, (7.16) is 
replaced by 


Re[Z\./(0, Ep)] = -200 (7.22) 


The right-hand side no longer depends upon Ep. The change in band minimum is Ey = —aW. 
This result now agrees with the Rayleigh—Schrédinger formula, which is given below. It is in 
much better agreement with the Feynman result than is the Tamm—Dancoff approximation. 
Brillouin—Wigner perturbation theory does better when more terms are included in the self- 
energy expression. The anticipated improvement is difficult to test since the next terms are 
quite formidable. 


7.1.2. Rayleigh-Schrodinger Perturbation Theory 


The Rayleigh—Schrodinger form of perturbation theory is the standard kind which is 
described in quantum mechanical textbooks such as Schiff (1955). It is also called on the 
mass shell perturbation theory. Energy and momentum are no longer separate variables. In 
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evaluating X(p, E) the energy E is set equal to €,, so the self-energy is just a function of one 
variable p, or, equivalently, ¢,. Of course, if the imaginary part of the self-energy is actually 
zero, then energy and momentum are uniquely related. Rayleigh—Schrédinger is an exact 
procedure under the same conditions that Brillouin—Wigner perturbation theory is valid—a 
zero value for the imaginary self-energy. The discussion is confined to the case of zero 
temperature and £, < © for which Im[2] = 0. 

The two perturbation theories have a direct link to the potential scattering theories of 
Sec. 4.1. The Rayleigh—Schrédinger perturbation theory is analogous to the reaction matrix 
theory of potential scattering, while Brillouin—Wigner is analogous to 7-matrix theory. In the 
former, one calculates real quantities on the mass shells, i.e., with E = g,. In the latter, one 
calculates a complex quantity for a general value of E. Recall that the reaction matrix result 
was not equal to the real part of the 7-matrix result. Similarly, the Rayleigh—Schrédinger 
electron—phonon self-energy of the electron is not found from the Brillouin—Wigner form by 
just setting E = £, and taking the real part. This happens to work for the one-phonon self- 
energy but not when higher-order terms are included. 

Successive terms in the perturbation theory for the self-energy function are called 
EO (p). The superscript denotes the number of phonons in the self-energy term. The self- 
energy is real, and the signature “Re” before the self-energy is omitted. The one-phonon self- 


energy is 


M? fq | 
IRM =S [H (7.23) 
(20) J I Ep — Ep+q — Wo 
The energy denominator contains the difference between the initial state energy €, and the 


intermediate state energy €,,, +o; the latter has one phonon excited. The summation over 
wave vectors is a summation over all intermediate states subject to momentum conservation. 
The self-energy is the same as the Brillouin—Wigner result (7.10) after replacing E by €,. This 
identity is true only for the one-phonon self-energies. For higher orders, in order to get 
Rayleigh—Schrédinger from the Brillouin—Wigner, replace E by £, and add some additional 
terms. 

The wave vector integrals in (7.23) are elementary. Indeed, just take (7.15) and replace E 
by €,, with the result for €, < @ 


3/2 g_\ 1/2 
ELp) = 220 gin! (=) (7.24) 
v€ 20 
In the limit where €, — 0, the simple result is: 
>8(0) = —4W (7.25) 


It is no accident that the self-energy is just —a@,). The definition of the constant x was chosen 
to obtain a simple formula for the self-energy. 
The polaron effective mass is derived by expanding the arcsine in a power series in 


J Ep/@o: 


le ey 
Ep) = —am)| 1 +>" + 0| 2 . 
RSP) | + 6m + o, (7.26) 
1 €, 4 
E, = € — &M_{ 1 + ba. + O(p’) (7.27) 
0 


= —a0) +8,(1 — 5) + O(p') (7.28) 
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The effective mass is m* = m/(1 — «/6). The polaron effects make the particle appear to be 
heavier than the band mass m. The extra mass arises because of the interaction between the 
electron and the ions. The electron causes a change in the equilibrium positions of the ions in 
its vicinity. The change in equilibrium position arises from the mutual interaction between the 
electron and ion, which was mentioned earlier. When the electron moves with a momentum p, 
it must drag this ionic deformation with it. It takes energy to move this deformation. The drag 
is what causes the mass to increase as the polaron coupling constant is increased. 

In the Tamm—Dancoff approximation, the formula (3.160) gives the effective mass 
m* = m(1 + «/2)/(1 + «/3). The details of this calculation are assigned as a problem. This 
result agrees with the Rayleigh—Schrédinger result at small values of «, since both formulas 
are proportional to m* = m[1 + «/6 + O(a)]. However, they behave quite differently in the 
intermediate coupling regime. The Rayleigh—Schrédinger result predicts that something 
calamitous happens at a + 6. A similar catastrophe is not implied by the Brillouin—Wigner 
formula, which is well behaved for all values of x. The Rayleigh—Schrédinger result is better. 
Something quite important does indeed happen at « ~ 6: The particle becomes localized. This 
statement will be proved in the next subsection on strong coupling theory. It has already been 
remarked, in connection with Table 7.1, that the zero-momentum values are much better in the 
Rayleigh—Schrodinger picture. 

At zero temperature, the two-phonon self-energy i in the Rayleigh—Schrédinger theory is 


1 1 
Zrs(P) = oy — || Mp 
a dia 4142 \€p — Ep+q, — 20/ | Ep — Ep+q: +q, — 220 


1 1 
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Ep B Ep+q, — Bo Ep 7 Ep+q, — Oo 


-o 7.29 
(Ep — Ep+a, — z (72) 


The wave vector integrals have been evaluated analytically for zero momentum. The integrals 
are complicated, with the result: 


3 | 
(0) = -0 [a(i + 5) -5| = —0.1059e a 7.30 
This result is combined with the earlier result for the one-phonon self-energy. The three- 
phonon self-energy was evaluated numerically by Smondyrev (1986), who computed the next 
terms in the Rayleigh—Schrodinger series for Ey and m*: 


Ey = —@p{a + 0.0159a7 + 0.000806% + O(a*)] (7.31) 
” L14 - +0.02362702 + o(a3) (7.32) 


Some values are tabulated in Table 7.1 and agree well with the Feynman values. The exact 
ground state energy is a power series in a. Just the first three terms in this series are known. It 
is remarkable that the coefficients of the « and « terms are so small. Even for intermediate 
coupling values 1 < « < 6, the term 0.0159a? makes less than a 10% contribution to the total 
value of Ey. Another way to say this is that first-order perturbation theory is a good 
approximation even for intermediate coupling strengths. It is further evidence for the 
superiority of Rayleigh—Schrédinger over Brillouin—Wigner for the zero-temperature polaron 


Sec. 7.1 e Frohlich Hamiltonian 441 


problem. It is assumed that higher-order terms also have a small coefficient and will also not 
contribute much toward FE, for intermediate values of coupling constant. 

It is natural to ask why the Rayleigh—Schrédinger method is so good. This point will be 
explained in detail in the next section. But it seems worthwhile to summarize some of these 
findings here. The independent boson model for a model system of electrons and phonons 
was solved in Chapter 4. The polaron model may not be solved exactly, but the Rayleigh— 
Schrédinger perturbation theory is related to the independent boson model. The self-energy 
for the interacting system of electrons and phonons arises from terms in which various 
numbers of phonons are virtually emitted into intermediate states. The one-phonon self- 
energy in the Rayleigh—Schrédinger picture corresponds to the assumption that all the 
phonons are virtually emitted independently of the others as in the independent-boson model. 
The two-phonon self-energy describes the correlations between the virtual emission of pairs 
of phonons. That is, Ëp) contains the basic contribution from emitting one phonon, two 
phonons and all numbers of phonons. The next term go RS 2) describes the correlations 
between pairs of phonon emissions. The following term X}s(p) describes three-phonon 
correlations, which are not just pairwise correlations. The term 0.0159«” is small, apparently, 
because correlations are not important at intermediate coupling. For this reason, the O(a’) 
terms are similarly small. 

At first it seems surprising that pee (p) has the self-energy from emissions of all different 
numbers of phonons but without correlations. However, it is similar to our findings for the 
independent boson model. There the exact self-energy was — ` q M? /®q, yet this described a 
ground state which had a mixture of large numbers of phonons. If you take the Rayleigh— 
Schrödinger theory and set all kinetic energy terms equal to the same constant &), you 
immediately recover the independent boson model in which the particle had constant energy. 
The one-phonon self-energy becomes 


M2 1 1 M2 
sp) Mo _~_y 4 33 
RSP) V 2 q2 Oo 2 Oq (7 ) 


It is the exact self-energy when the kinetic energy is constant, which means that all higher 
self-energy terms are zero. An inspection of EO (p) i in (7.29) shows that it vanishes when all 
kinetic energies are a constant. However, it must vanish, since the exact self-energy for the 
independent boson model is just a linear function of a. 

The independent boson model was solved in Sec. 4.3 by a variety of methods. It was not 
solved using Dyson’s equation with all of its self-energy diagrams. At zero temperature, this 
procedure would correspond to trying to solve it by Brillouin-Wigner perturbation theory. In 
fact, the simple exact self-energy cannot be obtained this way—at least no one has ever 
succeeded in doing it. Apparently one has to evaluate all the terms in all orders of perturbation 
theory. It is not surprising that this approach does not work well for the polaron problem 
either. 

A lover of antiques will enjoy that, for the polaron problem, the old-fashioned pertur- 
bation theory works better than Dyson’s equations. One might ask whether this is a general 
feature. If so, why bother to learn about Green’s functions? A general rule cannot be given. In 
some Hamiltonians the Rayleigh—Schrédinger method is best, and in others Dyson’s equation 
is better. Coupled mode problems, such as solved for polaritons in Sec. 4.6, need to be 
described by Brillouin—Wigner perturbation theory. Another example: whenever the 
imaginary part of the self-energy is significant, it seems necessary to use a Green’s function 
approach. The old-fashioned techniques do not allow for imaginary energies in a systematic 
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manner. The theory of strongly coupled superconductors, for example, could not be done 
without Green’s functions. The actual message is that Green’s functions are not the best way 
to solve all problems. Which problems are best solved by Green’s functions? Unfortunately, at 
the moment, the only way to tell is by trial and error. No general rules are available which 
predict when one perturbation method is better than another. Of course, for the polaron 
problem, Brillouin—Wigner perturbation theory works fine as long as the chemical potential is 
renormalized along with changes in the band energies. 

The word polaron describes the coupled system of electrons and ions. The self-energy is 
calculated for the mutually interacting system of electrons and phonons. Although it is the 
electron self-energy, part of the energy resides in the ions themselves. The ions are moved 
from their equilibrium position by the presence of the electron. The displacement takes some 
vibrational energy, which is part of the electron self-energy, because it follows the electron in 
its journey through the crystal. A simple analogy is the charge on a spring, which was 
discussed in Sec. 1.1. The application of the electric field causes a deformation of the spring. 
The final self-energy —(eF y /2K contains a part which is the energy needed to compress the 
spring to the new equilibrium. In the polaron motion, the springs governing ion vibration 
occur locally wherever the polaron is at the moment. 

The classical picture is that the ions in the polar lattice deform around the electron. The 
quantum picture is the same, except that the motion of the ions is quantized. The number of 
phonons is discrete. The amplitude of the ion displacements may not have a continuous range 
of values but only discrete amounts, which are phonons. If the average number of phonons 
around the electron is large, the quantization makes little difference. A laboratory spring does 
not appear to be quantized, although it surely is, but the displacements which usually are 
observed are so large, with such large quantum numbers, that the quantum nature is irrelevant. 
The same is true with our polaron if it has a large average number of phonons. These phonons 
are called the phonon cloud. 

One may try to calculate the average number of phonons in the polaron. As with all 
things in polaron theory, it may only be calculated approximately. Since the Rayleigh— 
Schrödinger method is the best, it is used here. From first-order perturbation theory, the wave 
function of the electron at zero temperature is 


Ip’) (pV Ip) 
Pe) = |p) + ), ————— 
p 2 Ep — Ep — Wo 
1 | pss My 1 eilp+a) - ral 10) (7.34) 
vv VV a 1 Ep — Eptq — ®o 


where |0) is the phonon vacuum, so ai|0) has one phonon with wave vector q. The total 
number of phonons is found by taking the expectation of the phonon number operator 
>, alay With this state: 


Nu(p) = | Prin) 3 ala Y (r) / l PrP Y, (7.35) 


The symbol Nu is used for number, to prevent confusion with other N symbols such as the 
thermal average number of phonons. The number operator on the first term in (7.34) is zero, 
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and on the second term it is unity. The first-order Rayleigh—Schrédinger prediction is 


Nu(p) = À 


i - <a (7.36) 


A(p) = Mo yb Ã——— (7.37) 


V q q? (Ep ~ Ep+q4 — Wo) 
This wave vector integral is done by making the same variable changes that were used to 
obtain (7.12) [v = cos 8, x = (q + pv)/V2m| 


A(p) = Miwon |" Kal dy} (7.38) 
-1 (Wp 
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The two integrals give, in turn, the values for (œp > €,) 


°° 1 T 1 
| dx —_—__——,, = 37.37 (7.39) 
-o (© — Ve, +x) 2 (0p — ve) 
l l 2 l 
l dv 5 = (7.40) 
-1 (M@- v’é,) Wo ./Mo — Ep 
so the final result is 
1/2 
Oo Wo 
A(p) = 5 l 2) O(Wo — £p) (7.41) 


For a particle with zero momentum, the result is A = «/2, Nu = «/(2 + a). Nu(0) is always 
less than one, since the eigenstate only allowed for the creation of one phonon. It should be 
remembered that Nu(p) is an average number of phonons. The actual number must be an 
integer, which fluctuates from time to time about this average number. 

These results apply for €, < @ọ. An attempt to calculate the formula for £, > Wy gives 
the result infinity. This answer is nonsensical, but the question was also nonsensical. A 
particle with kinetic energy larger than @, will eventually emit a real phonon and lower its 
energy to €, — @ọ. The number of phonons about the electron is not a stationary quantity 
when £, > Wp. It is not reasonable to try to evaluate a stationary matrix element such as Nu. 

Another quantity to calculate is the mean-free-path of the electron. At zero temperature, 
it is infinity when £, <  . However, at nonzero temperatures, there is a nonzero probability 
No that some thermally excited phonons do exist. One of them may be absorbed by the 
electron, thereby changing its energy and momentum. According to the golden rule (Schiff, 
1955), the transition probability is 


_2n tle dq Ms 

(2r) q’ 
This formula is nearly the same as twice the imaginary part of the retarded self-energy given 
in (7.19). The main difference is the replacement of E by £p. The second term of (7.19), 
proportional to Nọ + 1, has been omitted. It gives the rate of phonon emission; it is zero if 


€p < Wg. The integrals in the above expression are evaluated by changing variables to dxdv to 
give 


—- Noo(Eq + O0 — Ep+q) (7.42) 


_ Mov 2m 
At 


1 
“Ng p dx E dvõ(@ + ve, — x°) (7.43) 
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The two integrals may be evaluated in turn, 


°° 1 


! l [ar e 


to give the scattering rate for phonon absorption. The scattering rate is the same as the inverse 
lifetime of the particle w = 1/t: 


VERENA 
2 aig o| eee O FE a4 (7.46) 


The mean-free-path /, is found from the classical expression J, = t,,v,, so that one multiplies 
T, by p/m. At zero momentum, the lifetime is evaluated as 


ji 
— = 240 )No (7.47) 
To 


This lifetime is used to calculate the electron mobility ų = ety/m* as discussed in Chapter 8. 


7.1.3. Strong Coupling Theory 


The strong coupling theory for polarons was invented by Landau and Pekar (1946). 
Their theory was the first work on polarons, which even preceded the word polaron. Their 
theory, and its subsequent improvements, is now known to be valid at large values of «: hence 
the current name of strong coupling theory. The method of calculation is radically quite 
different from the prior perturbation theories. It is basically a variational calculation on a 
Gaussian wave function. The calculation is done first and the physics is discussed afterward. 

The Hamiltonian (7.3) is rewritten so that the phonon operators appear as displacements 
Q, and their conjugate momenta P,, which are chosen so they are dimensionless: 


1 
Q= a + a'a), ag = Ze + iP) (7.48) 
] 
P, = -3a ~al) ay = (Oy Pa) (7.49) 
_P sp E Qa igs 
H= +5 L (Pa + Og) + Me ym vo (7.50) 


The wave function of the many-particle system ®(r; Q,) must contain the coordinates of the 
electron r and the ion displacements Q,. We make what appears to be a drastic assumption: 
the electron is localized with a Gaussian wave function. Later this interpretation will soften, 
and it is shown that the particle is not quite localized. But for the moment assume that the 
total wave function is a simple product of electron and phonon coordinates: 


Dr; Oy) = PY, + Qg) (7.51) 


B 3/2 p? 
o(r) = (E) exp (-57) (7.52) 


Sec. 7.1 e Frohlich Hamiltonian 445 


where P is a variational parameter. The phonon wave functions , are the usual harmonic 
oscillator wave functions, except that they are centered about an equilibrium displacement 
—6Q,, which also needs to be determined. The first step in the calculation is to take the 
expectation of the Hamiltonian over the electron part of the coordinates. For the two r- 
dependent terms the Gaussian wave functions give 


HO) = | Prot) (7.53) 
Jero? roe — >= = af (7.54) 
| Proce T= ve (7.55) 


The second integral is most easily evaluated in (x, y, z) coordinates. The expectation value of 
the Hamiltonian is 


H(Q,) = ° 3 + PLP y + Oy) +E Lala (7.56) 
q 
L=M dai (7.57) 


The next step is to choose the equilibrium displacement 6Q, so that the term linear in Q, is 
eliminated: 


50,=—* (7.58) 
0 
H (Oy) = 2 EIP} + (0, + 80 7438 4 (7.59) 
T2449 <4 a 4m 2a’ 
= Z DIP} + (Oy + 804)"] + E(B) (7.60) 
q 


The first term on the right describes the harmonic vibrations of the phonons about their new 
equilibrium positions —6Q,. They have the harmonic wave functions wy, in (7.51) and the 
eigenvalues (n + 5)@). The second term 3B”/4m is the kinetic energy of the electron in the 
Gaussian wave function. The last term is the potential energy of interaction between the 
phonons and electron. The parameter P is varied to give the lowest energy for these last two 
terms. First the potential term must be evaluated: 


> 1/2 
l Mè Bq e FB ? 
Eu | a (7.61) 
0 


(2n)° q? mT 


3B B o k 
E(B) = - a Fe ) (7.62) 


mT 
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Set B = B*/2m, the last two terms are 


20 1/2 
E(B) = 3B — VBu (=) (7.63) 
1/2 
dE ___% (2a)\" (7.64) 
dB * 2/B\ tt 


The variation shows that the minimum value is ./By = («/3),/2@)/m so that the minimum 
energy is 


E(B) = — = = —0.1060200, (7.65) 


As usual with bound state variational calculations, the potential energy is twice the kinetic 
energy, in agreement with the virial theorem. This result was obtained by Landau and Pekar. 
The energy is proportional to a rather than «. 

The physics behind the variational theory is simple. The electron mass is much smaller 
than the ion mass, so the electron moves much faster. Its motion creates a charge density, and 
the ions respond to this average density. The small polaron occurs because of a strong 
feedback loop. The ions move to create a local potential which traps the electron in a bound 
state. The extent of the bound state depends upon the average motion of the electrons. The 
ions displace in response to the average motion of the electrons. 

In strong coupling theory, it is assumed that « is very large and one tries to evaluate the 
energy as a power series in O(1/a). The philosophy is very similar to that for deducing the 
correlation energy of the homogeneous electron gas in Secs. 5.1 and 5.2. One perturbation 
expansion was developed for small r, and another for high r,. The same is done for the 
polaron. The Rayleigh—Schrédinger expansion is valid at small «, and the strong coupling is 
valid at large a. The interpolation between these two limits is remarkably easy. The best 
available result for the strong coupling limit (Miyake, 1976) is 


lim E)(«) = —@p[0.1085«? + 2.836 + O(1/a2)] (7.66) 
YK OO 0 


The coefficient of the «” term is 0.1085, which is remarkably close to the Landau—Pekar result 
of 0.1061. This agreement shows that the Gaussian assumption is very accurate. 

Figure 7.2 shows a plot of the ground state energy for Rayleigh—Schrédinger pertur- 
bation theory as the solid curve labeled RS; also shown by the dashed line is the strong 
coupling theory using (7.66). The strong coupling theory appears to predict a lower energy 
state for small values of «. This value is deceptive, since terms of O(1/«?) have omitted in the 
asymptotic series, and these obviously diverge at small «. The strong coupling theory results 
should be believed only for values of « above, say, about 5 or 6. The two curves almost touch 
for values of « ~ 5. The value « ~ 5 is believed to be the crossover region between the two 
theories. The correct theory is the Rayleigh—Schrédinger result for values up to « < 5 and the 
strong coupling theory for « > 5. Such a curve is precisely the behavior of the Feynman 
theory. 

How large in space is the wave function of the small polaron? From the minimal value of 


Bo, the value of Bo is 
a [4a om 
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FIGURE 7.2 Energy of a polaron with zero momentum as a function of coupling constant. The solid line is the 
Rayleigh-Schrödinger theory, and the dashed line is the strong coupling theory. 


Bo has the dimensions of meters~!, so its inverse is roughly the size of the localized wave 
function. For hœg % 0.03 eV and m an electron mass, the polaron size is Bp l ~ 40 A /a. For a 
values around 5 to 6, the size is 7 Å, or about the size of the atomic unit cell. The localized 
electron will be influenced by the fact that the ions are atoms. The initial Hamiltonian (7.3) 
assumes a continuum theory for the ions, which is probably a reasonable approximation for 
small values of a but surely fails when the polaron size is of atomic dimension. Strong 
coupling theory cannot be applied to real solids without additional modifications to account 
for the atomic nature of the phonons. From now on polaron theory for x > 5 must be 
considered a mathematical model with interesting properties. The useful piece of physics is 
that polarons become localized on atomic sites for values of « larger than 5 to 6. The Fröhlich 
Hamiltonian does not describe their behavior after they are localized. 

The theory of polaron localization has a catch. One needs to know the band mass m to 
calculate «. Today one can deduce m from a good energy band calculation. In the early days 
they seldom had the required accuracy. Instead, it was customary to measure the polaron 
mobility and the effective mass, and the combined results will yield m, m*, and a (see Hodby, 
1972). The catch is that this procedure works only if the polaron is mobile. After it becomes 
localized, its mobility drops precipitously. The effective mass of a localized polaron cannot be 
measured by cyclotron resonance. It is difficult to deduce what the polaron constant « should 
have been after the particle is localized. It can only be deduced accurately if the particle is not 
localized. There are no experimental values of x above 4. Perhaps this value is the cutoff for 
localization of the polaron. 
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There are several other properties worth mentioning for polarons in strong coupling 
theory. The first is that the localization may occur anywhere. If the phonons were really a 
continuum, then the Gaussian packet would drift about and have an effective mass. The 
effective mass is quite large: Allcock (1956) has estimated it to be m* = 0.0208ma‘*. For 
example, at « = 5 it is m*/m = 13. A similar estimate, m* = 0.0227ma", is given by Miyake 
(1976). The polaron is heavy because it has to drag with it the potential well of the phonons. 
The prediction from Rayleigh—Schrédinger perturbation theory is that the effective mass 
m*/m was proportional to 1/(1 — «/6) so that something important seemed to happen at 
a ~ 6. What happens is localization, which makes the polaron much heavier. The heavy mass 
is not measurable because, as we said earlier, strongly coupled polarons are a model which we 
cannot apply to real systems without modification. 

Another feature of strongly coupled polarons is that they have excited states which are 
also localized. For example, try to construct a polaron state which has p-wave symmetry. A 
suitable trial wave function might be 


5, 1/2 
(r) = (a) zebr (7.68) 


One could repeat the variational procedure with B as a variational parameter (see Problem 9 at 
the end of this chapter). It will be different from the s-wave value Bọ, and it will be smaller. 
The wave function is more spread out, and the electron is less bound. It is called the relaxed 
excited state (see Kartheuser et al., 1969). If the excited state were stable, one could observe 
actual optical transitions to this state from the ground state of s symmetry. The excited state is 
not stable, since it is degenerate in energy with the ground s state plus L phonons, where L is 
whatever number is necessary to make up the energy difference. Nevertheless, theoretical 
calculations show a sharp line in the optical spectra, which has been interpreted as this s- to p- 
wave transition. The strongly coupled polaron can create its own internal structure. 

The strong coupling limit is calculated in the adiabatic approximation (see Allcock, 
1962). The electron has sufficient binding energy that its oscillatory motion in the potential 
well is much faster than the vibrational frequency of the phonons. The phonons do not have 
time to adjust to the individual oscillations of the electron. Instead they adjust to the average 
motion of the electrons. The ions are treated as a rigid potential well, in which the electron 
adiabatically oscillates. A quite different picture applies to the weak coupling limit. There the 
phonon energy is larger than that of the electrons. The picture is that the phonons, or ion 
polarization, follow the electron during its motion. 


7.1.4. Linked Cluster Theory 


The linked cluster theory was advocated by Brout and Carruthers (1963) as a general 
method to attack many-body problems. It is surprising that the method waited so long to be 
applied to the Frohlich polaron. It was not done until the work of Dunn (1975), although some 
earlier work of Mahan (1966) was similar. The surprise is even greater when it is realized that 
this method is an excellent way of obtaining the Green’s function G(p, t) for the polaron in the 
weak and intermediate coupling regimes. In the prior section, it was concluded that the 
Frohlich polaron Hamiltonian is useful only for these coupling values. The linked cluster 
theory is applicable for all relevant coupling strength. One disagreeable aspect of the tech- 
nique is that the final numbers have to be generated on the computer, since the formulas are 
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too complicated to permit analytical evaluation. This work is the price one pays for good 
Green’s functions. 

The linked cluster methods were described in Sec. 3.6. A problem with one electron may 
be worked in real time. The real-time formalism is valid even at nonzero temperatures. The 
Matsubara technique is not needed since the interactions do not change the thermodynamic 
averages. That is, when writing the Hamiltonian as H = Họ +V, Hy can be used in 
exp(—B#), rather than H, since V does not affect the thermodynamic averages. The only 
averaging is done over the phonons, and one electron in a macroscopic solid does not alter the 
phonon energies. This substitution, of using H, for H in the thermodynamic averaging, is not 
appropriate in a many-particle system where the particles influence the phonon modes—as in 
a metal or a heavily doped semiconductor. It is acceptable in the one-electron Fröhlich 
Hamiltonian. For the phonon Green’s function at nonzero temperature, take 


DO = —i[(No + De!" + Ne] (7.69) 
1 
0 ebo — ] 


(7.70) 


where D© (t) has no q dependence, since the phonons are assumed to have no dispersion. In 
this case D(#) = D(A) since all the phonon self-energies are zero. The only concern is to 
evaluate the electron Green’s function. By adapting the formulas of Sec. 3.6 to real time, the 
Green’s function for the electron may be written as an exponential function of momentum and 
time: 


G(p, t) = G (t) exp[F(p, Ð] (7.71) 
G(p, N = iO (He! (7.72) 


The function F'(p, t) is generated as a series of terms, which are obtained by the following 
systematic procedure. The electron Green’s function 


G(p, t) = —i(0|Te™ C se C110) = iO T UC) 
is evaluated in the interaction representation: 
G(p, t) = —iO(t) } ` W,(p, £) 
(7.73) 


(— i” t t a a a a 

W,(p, t) = dti +- | dtan (TC p OV) V Ctan)Cio (0) 
(27)! Jo 0 

The S-matrix expansion generates a series of scalar functions W,(p, t). They are resummed as 

an exponential series of terms F,(p, t) 


G(p, t) = —iO(t) $ W,,(p, t) = —iO(t) exp — ie,t + 3 F, Q, J 
S "=l (7.74) 
F(p, t) = 2, Fa (P, t) 


These two series may be equated term by term by assuming that both W, and F, are 
proportional to powers of the coupling constant «”. That is, define W, to be all of the terms in 
the S-matrix expansion which are proportional to «”. Similarly, when the series is resummed 
as an exponential, F, contains all the terms proportional to «”. For the polaron problem, the 
terms in the S-matrix are zero when / is odd, since they contain an odd number of phonon 
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creation or destruction operators. Only the terms with even / contribute, and n = 2/. With this 
convention, the relationship between some terms in the two series are: 


Fi = er W, (7.75) 
| l 

F, = e W, — gi (7.76) 
l 

F; = e W — FF, — zE (7.77) 


The terms are listed only up to n = 3, since that is already beyond what has been evaluated for 
the polaron problem. So far only the first two terms have been computed. However, these 
appear to be adequate to describe the polaron Green’s function for low and intermediate 
coupling. The term W,, or equivalently F,, was first evaluated by Mahan (1966). This term 
has the form 


a2 pt 
mp) == 


| an | dy (TÊ, (OPPO) (7.78) 


Write the Frohlich Hamiltonian as H = H, + V, 


=Le €, CIC o tM >, alag (7.79) 
a 
A Mo e~ Ep—Ep+q) A 
PO = Cosa Cpe (0 (7.80) 
qpo 
A,(t) = age + at qe (7.81) 


When evaluating the correlation function in (7.78), the phonon operators just give the phonon 
Green’s function (7.69), and the electron correlation function can be written as a product of 
three G by using Wick’s theorem. The expression to evaluate is 


dq M2 {' i 
Wp.) = — |e P | dt | dt,D(t, — t) 
x GO(p, t— t1)GO(p + q. ty ~ t)GO(p, ty) (7.82) 


For one particle in a band, the electron Green’s function is G(p, ) = —iO(t) exp[—ie,t]. The 
above expression simplifies to 


dq Mè 
(2n)° g? 
x [(No + Le ios —4) + Nelo —2)] (7.83) 


, fi l 
W, (p, t) = e | mip dt, | dt, e!®r—®pra hi =h) 
0 
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The two time integrals may be done without difficulty: 


e™' W (p, t) = Fi, 1) (7.84) 
[ey Sbaieorates) 
(2m q | \Q_@,q) Qp. q) 
1 — e®- 1 — ets 
+ (Ny + 1I)=>—— + WM ~——] (7.85) 
Q (P.a pg 
Q (P, q) = Ep — Ep+q + O0 (7.86) 


The function F(p, ¢) is determined. It has several features which are immediately interesting. 
There is a linear term in ¢. Its coefficient is just Rayleigh-Schrödinger one-phonon self-energy 


dq x No + 1 No ) 
(2ny gq? \Q_(p.q) Q,(p, q) 


It is just the real part of the self-energy, since the principal part is taken of the energy 
denominator. The self-energy is appropriate for nonzero temperatures. The result (7.23), for 
zero temperature, is obtained by setting Ny — 0. The first term describes processes whereby 
an electron of momentum p emits a phonon of wave vector q and goes to the state p + q. The 
second term describes processes where an electron in state p absorbs a phonon of q and goes 
to p + q. The energy denominator contains +@, depending on whether the intermediate state 
p +q has one more or one less phonon than the initial state. The evaluation of these wave 
vector integrals is assigned as a problem. 

The energy denominator Q (p, q) can vanish during the q integration. It enters one term 
in F, as (Q D7, which gives a possible divergence. However, there is no real divergence. 
Collect together all the terms in Q,, including those from the self-energy, and find the 
combination 


Op) = | (7.87) 


1 
— {1 + iQ, — e+} (7.88) 
(Q,)° + 
which is well behaved when Q, — 0. There is no actual divergence in the wave vector 
integration. 
Most numerical work has been done for the momentum state p = 0. In this case F, (0, £) 
can be obtained analytically in terms of the Fresnel integrals (Mahan, 1966) (s = t@ ) 


dte“ 


V (0, i) = — zz Q + D| -ivs + (25 + Ne Bo 
+No se" + (2s — 5 Ey) l (7.90) 


The easiest way to derive this result is to follow the method of Dunn (1975). His suggestion is 
to go back to (7.83) and do the wave vector integrals before the time integrals ¢,, and f 


d’q M3 


t ti 
dt | dte DN, + Le MoO) 4 pyet] (7.91 
(ny | 1 | dh [Wo + 1) 0 ] (7.91) 


0 0 


F (0, £) =-| 
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The wave vector integral has the form, for (t > 0) 


(0 @) 
| dge tt /2m — m (7.92) 
0 2it 
Since the integrand is only a function of t, — t», change the integration variable to f = t, — b, 

8 y 17h 8 £ 17h 
and then change the order of integrations to get 


mM; dt’ / —i@, KO) 
F,(0, = EF | eta + de of 4 Nelo") (7.93) 


The time integrals are now recognized as just Fresnel integrals, or integrals related to them. 
The result (7.90) is derived easily. An approximate Green’s function is obtained when we 
approximate F(0, t) by its first term F, (0, t). The approximate expression can be numerically 
Fourier-transformed to obtain the spectral function 


A(E) = -2 in| (=) | are (7.94) 
0 


In this way, an approximation is derived for the imaginary part of the Green’s function. The 
real part may also be obtained. 

The accuracy of this approximate Green’s function is not obvious. About the only way to 
judge is to also evaluate the next term F,(p, t), which was done by Dunn. The next term 
W,(p, t) produces three different terms, which correspond to the three Feynmann diagrams in 
Fig. 7.3. Usually such Feynman diagrams are associated with Dyson’s equation, which is not 
the case here. Such diagrams are convenient to use, but here they have a different inter- 
pretation. For example, the diagram in Fig. 7.3(c) is not evaluated in Dyson’s theory, since it is 
just the square of the one-phonon term. This identity is not true in the linked cluster theory, so 
it must now be evaluated. In fact the theory is not one of “linked clusters” at all, so the name 
is inappropriate. All diagrams are evaluated. A more descriptive name would be something 
like exponential resummation, which lacks pizzazz. 

Some numerical results of Dunn (1975) are shown in Fig. 7.4. In Fig. 7.4(a) the spectral 
functions for p = 0 and kgT = 0.4@,) are shown for three values of x. Dunn found that for 
a < | the two-phonon results did not change the spectral function at all. The curves calcu- 
lated with F = F; + F, were identical with Mahan’s, which used F = F}. For higher values 
of « there were changes, but they were not dramatic. Figure 7.4(b) shows a comparison of the 
one-phonon and two-phonon spectra functions for « = 6. The theory is not applicable at this 
high a value, because one should be using strong coupling theory with its internal excited 
states. However, one can conclude that the linked cluster theory converges rapidly for low and 
intermediate coupling strengths. 

The spectral functions shown in Fig. 7.4 describe the Green’s functions for all 
frequencies œ. The low-energy exponential tail is for states below the conduction band 
minimum. The peaks at multiples of œ are states with different numbers of phonons asso- 
ciated with the polaron. These are similar to those found for the independent boson model, 


(a) (b) (c) 
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except that here the peaks are broadened by the recoil of the electron. For « = 2 the lowest 
two peaks are the main ones. But for « = 4 and 6 the lowest peaks have small amplitude, 
which shows the influence of the renormalization factor. 

The one-phonon term F is very similar in form to the independent boson model. 
Compare (7.84) with (4.257) and (4.259). The only difference is the substitution of 
OQ = £4 — Ep+Q + Wy in the independent boson model, instead of +a . F, is given the same 
interpretation. It describes the Green’s function when successive emissions of phonons are 
independent. The approximate Green’s function, with only F,, does describe an electron 
coupled to many different numbers of phonons—but all uncorrelated. The addition of F, 
included correlations between pairs of phonon emissions. F, has little effect on either the self- 
energy or the spectral function itself at p = 0. 

The linked cluster theory is quite different from the Dyson’s equation approach. There 
the one-phonon self-energy =‘) (p, E) adds only a one-phonon peak to the self-energy. The 
two-phonon peak comes from the two-phonon self-energy, etc. In order to get a full Green’s 
function, with many phonon peaks, one needs to evaluate many self-energy diagrams. These 
higher-order diagrams are difficult to evaluate and have never been calculated. The Dyson’s 
equation method has not been solved to the same accuracy as the linked cluster theory. Of 
course, one can also obtain G(p, t) from Feynman theory. 


7.2. SMALL POLARON THEORY 


Polarons become “small” when they become localized, as in the strong coupling theory. 
In contrast, Frohlich polarons are sometimes called “large.” Small-polaron theory assumes 
that the size of the polaron corresponds with atomic dimensions. It recognizes the periodicity 
of the solid and thereby assumes that the motion of the particle is no longer translationally 
continuous. Instead, it assumes that the particle, usually an electron, may occupy an orbital 
state b(r — R,) centered on atomic site R,. The orbital states are identical on each site, so 
there is periodicity. The particle may move from site to site, exactly as in the tight-binding 
model. The motion from site to site may be caused by the overlap, or nonorthogonality, of the 
orbitals on adjacent sites. The phonons are coupled to the particle at whichever site it is on. 
The following Hamiltonian describes the motion of small polarons: 


H=J 2 C! C + 3 Ogalag + Vo (7.95) 
J; 
QO =D Xe (a t+aly), X= — (7.96) 
q q 
Q, = TXabae (aq + ata) (7.97) 
V% =D Z GOO; (7.98) 
Hep = PEGG J O = Qs) (7.99) 


Vip) = Ds X Cis * (Q — Qs) (7.100) 
J 


Sec. 7.2 e Small Polaron Theory 455 


where C; is the destruction operator for a particle on site R;. The summation 6 is over the 
nearest neighbors. Spin indices are not important in this problem and are omitted. If (7.95) is 
compared with the Fröhlich Hamiltonian (7.2), only the first term is different. The present 
Hamiltonian (7.95), which includes the periodicity of the solid through the tight-binding 
model, is much more realistic. 

Three different forms are provided for the electron-phonon coupling Vep. The first one 
Vi) treats the ion displacement Q; as a scalar. This model is appropriate for organic solids, 
where the vibrational modes are those of the molecule. As an electron hops from molecule to 
molecule, it can excite vibrational modes of the molecule it is resting on. Then Q; is the 
amplitude of the normal mode. The second form of the interaction VY arises from the 
Coulomb interaction between the electron and the neighboring ions. When an electron is on 
ion R; then there is a contraction or expansion of the separation between this ion and its 
neighbors. The third form of the electron—electron interaction Ve) is due to the phonon 
modulation of the hopping. The first term in (7.95) has a hopping parameter J(6) which must 
depends upon the distance to the neighbors. If this distance is modulated by ion vibrations, 
there is a corresponding electron-phonon interaction. In this case the coupling constant 
D, = dJ /dr. Both interactions V@ and V®) are present in all tight-binding solids. The first is 
appropriate for molecular solids. 

The discussion of small polarons will follow the classic treatment of Holstein (1959), 
which followed the pioneering work of Tiablikov (1952) and Yamashita and Kurosawa 
(1958). They took Vs, as the electron—phonon interaction. The same analysis works for V2). 

The small-polaron Hamiltonian exhibits two types of behavior which are very different. _ 
Each type of behavior is described in detail. Afterwards, the interesting problem is discussed 
of the transition region between these limiting cases. | 


7.2.1. Large Polarons 


The first class of behavior is “large” polaron motion, of the Frohlich type. It occurs 
whenever the bandwidth zJ is large, where z is the coordination number. The condition on the 
bandwidth is stated later with more precision. When the bandwidth is large, the Hamiltonian 
is solved in wave vector space. Transform to collective coordinates: 


Cy =y (7.101) 

H =zJ 2 Wy CLC, + 3 Ogata, + 2 ChagliM(k, q)(ay +a" 4) 
y =p? (7.102) 
Mík, q) = Xq{D, — D24 + F(a) + D3E, * [F*(k + q) — F*(k)]} (7.103) 
F(k) = 3 Sek © — —iVy, (7.104) 


where €, is the unit vector for phonon polarization. The electron-phonon interaction has the 
usual form, except that the matrix element M(k, q) depends upon k as well as on q. The 
dependence upon k comes from the phonon-modulated hopping. This term will be dropped 
(D; = 0), so that the electron-phonon matrix element M(q) = M, will depend only upon q. 
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The small-polaron Hamiltonian has a close resemblance to the Fröhlich polaron 
Hamiltonian. The only difference is the replacement of the free-particle energy €, = k*/2m 
by the tight-binding form €x = z/y,, and M, has a different dependence upon q. 

The wave vector summation for particles extends only over the Brillouin zone. In a 
realistic model of a solid, there would be many bands to be summed. For a large bandwidth, 
the particle will confine its motion to states near the bottom of the band. For J negative, the 
bottom of the band occurs at k = 0. Expand y, about the point k = 0, and find in cubic 
crystals ¢, = zJ(1 — (k8)*/6), so that the particle has an effective mass of m=! = z|J|8 /3. If 
the polaron coupling strength M, is small, this Hamiltonian can be described by weak 
coupling theory. There is a slight change in the particle’s energy and effective mass because of 
polaron effects. These changes must correspond, in the tight-binding model, to a change in 
bandwidth. The polaron self-energy, in first-order Rayleigh—Schrédinger perturbation theory, 
is 


(7.105) 


N, + 1—n-(e N. € 
q 


The wave vector integrals are rather hard to evaluate for most forms of MŽ, because the 
energy denominators have the inconvenient expressions & 49 = 2/Y,49- Numerical results 
have been obtained in one dimension (Brown et al., 1997). Frohlich-type polaron effects are 
obtained when the bandwidth is large and the polaron effects are small. 


7.2.2. Small Polarons 


Next consider the other limiting case of the small-polaron Hamiltonian (7.95). The 
polaron effects are assumed to be dominant and the bandwidth is small. The physical picture 
is that the polaron effects localize the particle on a site and that hopping occurs infrequently 
from site to site. The tight-binding term is the perturbation, while the particle-phonon term is 
large. The Hamiltonian is solved in position space for the particles without resorting to 
collective coordinates. The first step is to apply a canonical transformation which diagonalizes 
the last two terms in the Hamiltonian. These last two terms are the same as found in the 
exactly solvable models of Sec. 4.3 for the many-boson model. The canonical transformation 
has the same form: 


H = Hes (7.106) 
, M 
S=- } net R, MD a — aa) (7.107) 
ją Oq 
T t t 
H =J 2 Ci sO Xj, 5X) + 2 Wyala,—A X nj (7.108) 
May 
A= yo (7.109) 
q Wg 
iq: rR, M(q) 
X; = exp pa R, y — to] (7.110) 
q 


The polaron self-energy is A. The factors X, were encountered previously in Sec. 4.3.4. They 
arise from the canonical transformation of the particle operators e° Cje~* = C,X;. The number 
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operator n; = =C G commutes with S and 1 is unaffected by the transformation. But the tight- 
binding term von 3C; produces factor Xx}, 5X; The first term is not solvable exactly, so the 
canonical transformation did not diagonalize this Hamiltonian. However, so far no approx- 
imations have been made. The eigenstates and energy levels of H are the same as H. It is 
fruitful to investigate the solutions of H. Use the interaction representation to set 


H=H+V: 


Hy =} @,aha, — AYN, (7.111) 
m F 
— t t 
V =I 2 Cra CGA (7.112) 
J 

The exponents in the factor X, l% can be combined since they commute. Also assume that 
M*,=M 
—q q 


aR aa M 
X} să; -o| De Bq- e’ MO a, (7.113) 
q 


Combining the two factors simplifies the form of this operator for taking the expectation of 
phonon operators. The perturbation V describes the hopping of the polaron from one site R, 
to the neighboring site R, + 6. The amplitude of this process is 


t 
J( 1X1 sXli) (7.114) 


where |Z) and |f) describe the phonon occupation numbers in the initial and final states of the 
transition. Since the factor Xi 34; permits phonons to be made or destroyed, the states |i) and 
|f) may have different numbers of phonons. 

Holstein (1959) made a distinction between events in which the number of phonons were 
changed during the hop and events where they did not change. If the number of phonons is 
changed in the hop, it is an inelastic scattering process. The particle loses its phase coherence 
by this emission or absorption of phonons. Each hop becomes a statistically independent 
event. The particle motion is diffusive, since it has Brownian motion by randomly hopping 
from site to site. These events are called nondiagonal transitions. 

The other situation is a diagonal transition in which all the phonon occupation numbers 
remain the same during the hop. The number of phonons ną in each state q remains unaltered. 
(Note: ną is the integer number of phonons, while N, is the thermal average of ną.) This 
condition is rather strict. For example, if the number of phonons is unchanged but one state 
gains a phonon while another loses one, it is a nondiagonal transition. The phase coherence of 
a particle is maintained during a diagonal transition. If it is the most likely hopping event, the 
particle will hop from site to site while retaining phase coherence. Then it is a Bloch particle 
and forms energy bands. The diagonal part of the hopping probability is defined as 


er = (il X 5X)li) (7.115) 


If the nondiagonal transition probability is small, the particle will form energy bands with a 
band energy 


Ek = ZJ ype "T (7.116) 


The effect of the polarons is to réduce the bandwidth by a factor exp(—Sr). The effective 
mass is increased by the same factor. In the Holstein picture, the diagonal hops contribute to 
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the real part of the self-energy and the nondiagonal transitions contribute to the imaginary 
part of the self-energy. 

The large-polaron behavior, of the Fröhlich type, occurs when the diagonal transitions 
dominate. The small-polaron behavior, with diffusive hopping, occurs whenever the 
nondiagonal transitions dominate. 


7.2.3. Diagonal Transitions 


The diagonal and nondiagonal transition rates are obtained by using the same type of 
operator techniques introduced in Sec. 4.3. The diagonal transition rate is calculated from 
(6.2.5). The factor (f |X, ‘ OX; Xli) has two operators in the exponential which are Feynman 
disentangled, with the destruction operators to the right. Using the techniques introduced into 
Sec. 4.3 gives 


ga By — gia 8) MW) (7.117) 
q 


eSt = Ma lile li) (7.118) 


The result is given by events with the same number of phonons in each state q for both (i| and 
|i). It is assumed that the number of phonons in each state is determined by a thermal average. 
The summation is taken over each possible number ną of phonons in each state, with the 
thermal probability exp(—B@q7ą) for this occurrence [see (4.239)]: 


OO * 
e St —T] eo( A ral ‘\p — e Poa] y eo Ba” (nje a! o-n) 
n=0 
= IT, expl- làa (N, + 4] (7.119) 
So the thermal factor Sy has the form 
M 2 
( a) [1 — cos(q + 8)(2N, + 1) (7.120) 


Sr=} 


q q 


l 
N, = 


7.121 
a obo, _ |] ( ) 


The factor Sp is temperature dependent and increases with increasing temperature. For 
temperatures larger than the Debye temperature, one may often use the expansion 
Ng © kgT/@q — 5 + O(Bwg) to write 


Sp any (MO) [1 — cos(q - 8)] + O(1/T) (7.122) 
q 


so that Sy increases linearly with temperature at high temperature. The factor exp(—S,) 
determines the rate of diagonal transitions and decreases with increasing temperature. The 
Frohlich-type polaron effects, with coherent motion, are most likely to be found at lower 
temperatures. As the temperature is increased, the polaron bandwidth zJ exp(—S,) becomes 
smaller and smaller. It is reasonable to expect that the type of motion will change at some 
temperature. In this model, the change is to a hopping type motion. 
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The factor 6 is the distance the polaron hops in a single motion. It is presumably a lattice 
constant, or at least a distance associated with some fundamental unit of length on the scale of 
the crystal unit cell. The factor [1 — cos(q - 5)] suggests that phonons of short wavelength are 
important, since this factor vanishes for q — 0. The integrations in (7.120) and (7.122) 
probably have significant contributions from phonons near the edge of the Brillouin zone. 
These integrals must be done carefully and probably numerically. Simple models, such as the 
Debye model, are probably very inaccurate. Of course, the matrix elements M(q) must also be 
found accurately for zone edge phonons. The Frohlich result M(q) ~ 1/q is a valid 
approximation only at long wavelength for LO phonons. It is not adequate for the q inte- 
grations in (7.120) or (7.122). 


7.2.4. Nondiagonal Transitions 


Nondiagonal transitions are hops, from site R; to R, +6, in which the number of 
phonons is not conserved. It is an inelastic scattering process. The matrix element 
(FX, ¢ 1 AX, |i) describes the transition from one quantum state to another. By itself, it does not 
convey meaningful information. Instead, the transition rate w must be evaluated. It is defined 
as the rate per unit time at which the hopping occurs. It is calculated by first finding the 
correlation function W(t), obtained by summing over all final states except the state | f} = |), 


WO = J? Six; (OM a(S) FIX Xli) (7.123) 


= JX; OX OX), Ali) — IX} OX OD ALA] (7.124 


and then evaluating the Fourier transform: 
1 (00) 
w = — l dtW (t) (7.125) 
h? Jo 


This formula is just the Fermi golden rule (Schiff, 1955). The transition rate is directly 
proportional to the dc conductivity calculated from the Kubo formula. The latter relationship 
is o = Be25" Nyw/3. The derivation of © is provided later, after the evaluation of w. The first 
term in W(t) is a time-dependent correlation function, which is identical in mathematical form 
to that found previously for the optical absorption in the many-boson model [see (4.308)]. 
The previous result may be utilized, by direct analogy, to give the following result: 


exp[—®()] = (iX OXOX] Xli) 


| (7.126) 
DA) _ Mo 4° 6/2 HOA 4 a — e!a!) +N,(1 _ ela!) 


q q 


The second term in W(t) arises from the diagonal transition rate. Note that ix ()Xj45(2)|Z) is 
not actually dependent upon time. Repeat, the above evaluation of S; with time-dependent 
factors à. Since the result depends upon |A|*, the time cancels out. So the first impulse is to set 
this term equal to the ser of the amplitude, or J? exp(—25S,). This step is too hasty. The 
thermal average of |(i|X; ()X; Oli? is not usually equal to the square of the average. The 
diagonal transition matrix element, for a state with n phonons, is 


nX} Xn) = e" L (lug?) (7.127) 
M(q) 


Ug =], C — ga" ò) (7.128) 
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where L,,(x) is the Laguerre polynominal. The quantity exp(—S_) is obtained by averaging 

this amplitude over all possible values of n. A slightly different result is obtained if we 
average the square of the amplitude over all possible values of n: 
2 _ Ben 

EX al? = ge Ma! [1 — eP] D e Pa", u (7.129) 


n=0 


= Mylo] 21u! 2 NANa +1) | exp- 257) (7.130) 


which should be compared to the summation in (7.120). Note that |A,| = |uql. The average 
over the square contains another factor, a modified Bessel function, which is obtained for each 
state q. The final diagonal hopping probability is obtained by taking the product of this result 
over all q states. 

Now take the limit where the volume of the solid v goes to infinity. The particle-phonon 
matrix element M(q) actually has a factor of 1/,/v. Write this matrix element as 
M(q) = M(q)/,/v, where M does not have any volume dependence and is only a function of 
_q. In the limit v — œ, the summations which determine Sy and ®(t) become integrals which 
are well behaved. However, the Bessel function J, in (7.130) has an argument which vanishes 
in this limit. When z — 0, the Bessel function is 


lim pE) = 1 + O(z*) (7.131) 
lim In[Jo(z)] = 7 4 O(z*) (7.132) 


oq > (7.133) 


8 
im Emo“ w) | = ft (2n)3 


In the limit of v —> oo, the factor of J)(z) in (7.130) goes to one. The function W(t) is then 


W(t) = Jer [e9 — 1] (7.134) 
dq M(q)’ | 
287 = i a I1 — es’ 52N, +1) (7.135) 
q 
b(t) = 2S7 — (t) (7.136) 
3. AN e\2 
7 j 2 Š =~ JL = ef PIN + De! + Nye’! (7.137) 
n q 
The time integral must be done next to obtain the final transition rate in (7.125). 
J? °° 
w= er | dt(e?® — 1) (7.138) 
h —o0 


which is the Golden Rule of quantum mechanics. 

In evaluating this expression, some assumptions have to be made regarding the nature of 
the spectrum of phonons. One case is an Einstein model where all phonons have the same 
energy. This case is interesting because there are many combinations of phonons which give 
the net same change in energy. This case was discussed by Nagaev (1963) but is not treated 
here. Instead, the phonons will be assumed to possess a realistic spectrum in that most of the 
phonon states q have different energies ñ@gq. Of course, in a solid the states in the star of the 
wave vector have the same energy, so there are always several q states with the same energy. 
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But the special features from Einstein phonons do not occur from a few states, but from large 
sections of the Brillouin zone which are degenerate. The restriction to dispersive phonons is 
asssumed because it is complicated to include events in which energy can be conserved while 
phonons are being destroyed in one state, say q,, while being created in another, q,. Energy is 
conserved easily if @j = ©q, but not otherwise. 

For a general spectrum of dispersive phonons, containing both acoustical and optical 
types, the exponential function (7) has a complicated time dependence, and the time inte- 
grals should be done numerically. An approximate result may be obtained by a saddle-point 
integration, which is done below. First, it is useful to discuss the convergence and properties 
of the time integral. 

At very large values of t, then (t) approaches zero, which is fortunate since otherwise 
the integral would diverge. However, in one dimension DeWit (1968) has shown that the 
approach to unity is too slow. He showed that lim, , ,, P(A > O(1/./t), so that the integral in 
(7.138) does not converge. Since there are no one-dimensional solids, perhaps this anomaly is 
no disaster. However, from the pedagogical viewpoint, it is unfortunate since one-dimensional 
problems are nice for homework. In three dimensions the convergence is fast enough, so that 
w has a definite value. 

The physics is understood by investigating the frequency spectrum of just the first factor 
in W(t). Call its Fourier transform U (œ): 


OO (0,9) 
dee) = er | dte™ t 0 (7.139) 


—OO 


U(@) = | 

—OO 
A typical spectrum is shown in Fig. 7.5. There is a delta function at @ = 0 plus a smooth 
spectrum. The delta function is the no-phonon transition. It is the probability that the tran- 
sition takes place without any phonons being emitted or absorbed. As long as this event has a 
nonzero probability, one will see a delta function in the spectrum. The intensity of the delta 
function is just proportional to exp(—2S,), which is called the Debye-Waller factor. In the 
hopping transition, the equivalent of the no-phonon line is hopping by a diagonal transition. 
The diagonal terms should be eliminated. The subtraction of the —1 factor from the time 
integrand, in the definition of w, is just what is needed to eliminate the delta function. The 
value of w is just the smooth part of the curve in Fig. 7.5 evaluated at œ = 0. 

An approximate result for w may be obtained by a saddle-point integration (Schotte, 
1966). This approximate result is very accurate in the limit where the particle-phonon 
coupling is large, so the polaron effects are significant. 

First we rewrite o(t) in (7.137) as 


(e) =2 2 [igl /Na (Ng + cosl o (: + *) (7.140) 


FIGURE 7.5 The function U(o). 
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The ¢ integration is treated as a contour integral in a complex space. The path of integration is 
deformed to go over the saddle point. The saddle point t, is located at the point t, = —iB/2. 
The deformed contour is shown in Fig. 7.6, with the saddle point at @. The function 
P(t) = (z) is expanded as a power series about the saddle point, where the distance from the 
saddle point is z = t + iB /2 


(z) = 2 2 Iual? / N (Na + 1) cos(@gz) (7.141) 


~ (0) — YZ + O(z*) (7.142) 


(0) = 2 3 lugl?/Ng(Nq + 1) (7.143) 
=X jua Ogy NNa + 1) (7.144) 


An approximate result is obtained by neglecting all terms except the first two. The time 
integration, in the vicinity of the saddle point, is now just a Gaussian integral: 


| Jed x 290) l de? = 0 J: 
y 


w= 7 expl-25r + 6(0)] (7.145) 
The saddle-point integral does the exp[—(¢)] part of the integrand and not the —1 term. 
However, the latter contribution just eliminates the delta function corresponding to the no- 
phonon transitions. The saddle-point integration does not have this contribution. It provides 
the smooth background part of U(@) at œ = 0, which is precisely the contribution which is 
needed for w. 

The two exponential terms in (7.145) can be combined, 


257 — §(0) = Dal? [ Na+ 1 - My | (7.146) 


which is a positive definite form. The exponent in (7.145) is always negative. At high 
temperatures, the phonon occupation numbers may be expanded: N, —> (kgT/@q) — 
++ O(@,/kgT). Define x = Boy, and the factors in the exponent give 


2 
tim| (Ny +1- M| =20 +1 -2N 0, +1) 


2 1 1\'7 x 
| - 1 
287 — (0) > a> lglg (7.148) 
B q 
© 
\ ¿B 
2 


FIGURE 7.6 Path of integration in complex ż plane. 
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The high-temperature expansion defines an activation energy A for the hopping rate 


dq Mla’ 


1 — e452 7.149 
ny a, | | ( ) 


A — 1 2 _ 1 
A =32 lüql o=}| 
q 


The same factor A defines y = 4AkpT at high temperature. 

Small-polaron theory predicts that the hopping rate is thermally activated. The rate of 
hopping increases at higher temperatures. This behavior is in direct contrast to the rate 
J? exp(—2S7) of diagonal transitions, which decreases with increasing temperatures. As 
temperature increases, the diagonal transitions become less likely, while the nondiagonal 
transitions become more likely. The band-type polaron motion, or large polarons of the 
Frohlich type, will exist only at small temperatures. Holstein estimated the transition 
temperature between band motion and hopping motion to occur around 40% of the Debye 
temperature. The estimate is remarkably insensitive to the magnitude of J. In the Holstein 
model, the low-temperature motion should be band-like, while the high-temperature motion is 
hopping. There have been many experimental systems with these characteristics which have 
been ascribed to small-polaron theory. One example is TiO, (Bogomolov et al., 1968). They 
observe the transition from band to hopping conductivity at about 300°C. The conduction 
bands of transition metal oxides are often d bands, and conduction in them seems well 
described by small-polaron theory. See the reviews by Adler (1967), Appel (1967), and 
Böttger and Bryksin (1976). 

There are many other systems in which the electron conductivity is thermally activated. 
They may often be otherwise explained, e.g., by an activation energy for freeing bound 
electrons from defects or alternately by a static barrier to the electron motion. However, it is 
thought that in most systems in which the electron motion is by hopping there is some 
polarization of the electron by its immediate surroundings. The hopping electron must carry 
its local polarization along, which gives the activation energy. The small-polaron picture 
applies when this polarization is due to phonons or atomic realignment near the electron. Of 
course, there could also be polarization of the electronic states, for example, the dielectric 
screening by the material of the electron charge. Electronic polarization can be described by 
an electronic polaron model, which has the same mathematical form as small-polaron theory. 
Here the bosons are not phonons but density operators representing electron-hole pair 
excitations of the system. This concept applies even in insulators, where the pair oscillations 
have an energy gap. 

The important message in small-polaron theory is that the motion is thermally activated. 
If the particle polarizes its surroundings, then it can hop only by moving this polarization 
along. The greater the polarization, the less likely hopping occurs. Each jump may occur only 
when the polarization arrangements on initial and final sites are the same. Since the system is 
in a state of continual fluctuation, this coincidence sometimes does happen. It is less likely 
when the polarization is most severe. That is the reason for the thermal activation. 


7.2.5. Kubo Formula 


The Kubo formula for small polarons was evaluated by Lang and Firsov (1963, 1964). 
The conductivity o(@) is evaluated from the current—current correlation function. The limit of 
œ — 0 provides the dc conductivity, while the optical absorption is given by the result for 
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nonzero frequencies. Both results are interesting for small polarons. The discussion will begin 
with the form of the Kubo formula given in Problem 16 of Chapter 3: 


l—e bore 
Re(o) =| dte™ (ILAI ,(0)) (7.150) 
= GUS C! CX} 5X) (7.151) 
J 


The electrical current operator is the hopping form (1.204), which has been subsequently 
altered by the canonical transformation (7.108). The spatial subscripts « are replaced by the 
diagonal sum in cubic crystals: 


Re[o(o)] “oF T 2 3°35 | dte’ 


(Ch OCOC G(X OOX yX)) (7.152) 


In the electron correlation function, the operators have a time development governed by 
e!4tCe-iHt where H also contains the hopping term V in (7.112). In the interaction repre- 
sentation, the expansion of the S-matrix will result in terms of higher power in J. These terms 
are seldom considered, and it is customary to evaluate only the term of order J*, which is 
certainly the leading term. However, the neglect of higher terms in J is mostly expediency, 
since they are hard to calculate. There is generally no proof available that these higher-order 
terms are smaller. 

In the interaction representation, the zeroth-order term in the S-matrix expansion just 
replaces the electron part of the correlation function by its value obtained using the time 
development governed by H, rather than H. This leading term in the conductivity will be 
called o(@). In this approximation, the electron correlation function is easy to evaluate 
since there is no time dependence: 


(Cl (OC; (Cİ 48! Cir) = ÖÖ (ny (1 — Ny+s')) (7.153) 


The correlation function is just equal to c(1 — c), which is the probability c that the initial site 
is occupied times the probability (1 — c) that the final site is empty. It is the correct average 
for this correlation function whenever there is no correlation between the site occupations of 
neighboring particles. Such correlations exist, for example, whenever the forces between 
particles on different sites are included. Most model calculations omit such forces. 

The electron correlation function is nonzero only when j = 7’ + 5 and 8 = —8. The 
calculation of the real part of the conductivity o() is now reduced to the evaluation of the 
correlation function for phonon coordinates, which is the nondiagonal transition rate for each 
site, summed over sites 


ewe ; 
see a-o dte™ (XOX OX X) (1154) 


—OO 


Re[o(a)] 


The solid is assumed to have sufficient symmetry that the rate for each pair of sites is the 
same. The summation over sites j just yields the number of sites, which is called M. The 
summation over 6 yields the coordination number z. The number of particles is N, = Ne, 
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where c is the concentration. For dimensional reasons, divide by the volume of the solid v, so 
that the real part of the conductivity is 


Oey 12 (NAPE 
Re[o(o)] = (=) c(1 — c)U(o) (7.155) 
U(@) = i dte™ OX, SOX, 6%; (7.156) 


The conductivity is proportional to the function U (œ), which was mentioned above. It has a 
delta function at zero frequency, which should also be eliminated from Re[o()]. The real 
part of the conductivity o©(«) is just proportional to the Fourier transform of the hopping 
correlation function W(t). The dc result œ — 0 was evaluated earlier in (7.95) by a saddle- 
point integration, which for the de conductivity gives 


2 
of) = = c(1 —c) (=) Bw (7.157) 
6v 
J? A A 
w=-+[(-] exp| -———; (7.158) 
h* \y kgT 


The same saddle-point integration also provides an estimate of the frequency-dependent 
conductivity (Reik, 1972). Again the integration variable is changed to z = t + iB/2, and (z) 
is expanded about the point z = 0: 


CO 
U(o) = eas | dte™ e% (7.159) 
— e251 t+ Bo/2 | dz toO (7.160) 
—CO 
x e251 +Bo/2+(0) D dze- Y7 (7.161) 
—C 
rt 1/2 7 2 
x (=) exp] —2S, + Bo/2 + (0) — — (7.162) 
y 4y 
sinh(Ba/2) o? 
Re[o(w)] = oY ———— L ex -5 (7.163) 
d (Bo/2) PL 4y 


At nonzero frequency the integral over z is still of the Gaussian form and yields a Gaussian 
function of frequency. The result for Re[o™(m)] is important for relating theory and 
experiment. It provides a prediction of proportionality between the de conductivity and the 
optical absorption. For example, it predicts that the optical absorption is Gaussian, with a 
width given by 4y = 16Ak,7, where A is the activation energy observed in the de conduc- 
tivity. This prediction may be tested experimentally. It also predicts that the magnitude of the 
absorption is proportional to the magnitude of the dc conductivity at each temperature. These 
predictions are confirmed in TiO,. Figure 7.7 shows the optical data of Kudinov et al. (1970). 
The absorption data is in satisfactory agreement with the theoretical curve, which is calcu- 
lated using parameters which also fit the dc conductivity. The physics of the absorption 
process may be understood using a configurational coordinate picture suggested by Polder 
(unpublished; see Reik, 1972). The coordinates are illustrated in Fig. 7.8. A particle sitting at 
site j has a parabolic potential energy curve which represents the potential energy of the 


466 Chap. 7 e Electron-Phonon Interaction 


O 2 4 6 8 IO 12 14 I6 3 -i 
v, IO~ cm 
L i i i a d a a mammam 
0.5 1.0 1.5 20 
E,eV 


FIGURE 7.7 Optical absorption by small polarons in TiO, for E||c at T = 600 K given by points. Dashed curve is 
theory with A@, = 800 cm~! = 0.1 eV. Source: Kudinov et al. (1970) (used with permission). 


phonons for a displacement Q about the equilibrium point. The neighboring site j + 6 has an 
identical parabola, displaced to the new equilibrium point, at a distance Q). The dc 
conductivity occurs by thermal activation over the intervening potential barrier A, which is 
midway between the two parabolas. If the parabolas have a curvature given by æ, so that the 
potential energy curves are V(Q) = «Q*, then the activation energy is A = V(Q)/2) = 
«0% /4. Similarly, the most probable optical transition is the vertical arrow in Fig. 7.8. It puts 
the particle on the neighboring parabola, and later the particle relaxes down to the minimum 
at the neighboring point. The length of this arrow is the energy /@,, = V(Qo) = «Q3, which 
is four times the activation energy. The proportionality 4@,, = 4A agrees with the result 
(7.163). The configurational coordinate model may be used, in fact, to derive (7.163) directly. 
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FIGURE 7.8 Configurational coordinate drawing of small-polaron motion. The probable optical absorption is the 
transition @,,, while the hopping barrier is A. 
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7.3. HEAVILY DOPED SEMICONDUCTORS 


An interesting application of polaron theory is in many-particle systems. If the solid has 
many electrons, one has to consider electron—electron interactions along with the electron- 
phonon interactions. They are very important in metals, which are discussed in the next 
section. Other systems, which mix the two theories of polarons and electron—electron inter- 
actions, are semiconductors which are heavily doped with impurities. Many II-V semi- 
conductors are polar and may also be doped sufficiently with impurities to form a degenerate 
electron gas. Then at low temperatures the electron gas has a well-defined Fermi energy, and 
the electrical resistivity is finite. 

These systems are interesting from several viewpoints. As an electron gas, they may 
have an effective density which is very high. The actual density is quite low, perhaps only 
ny ~ 10!8-10!? cm-3. However, the value of r, is calculated with the effective Bohr radius, 


h? & 
as = (7.164) 
43\ —1/3 
r= (ma) (7.165) 


which has the effective mass m* and static dielectric function £ọ for the semiconductor. 
Because typical values are m*/m ~ 0.1-0.2 and £ọ ~ 10, the effective Bohr radius may be as 
large as 5nm. Even for an electron concentration of nọ ~ 10!8 cm’, a Bohr radius of 
aš ~ 5 nm gives r, ~ 1, which is a smaller value than found in metals. Even smaller values 
of r, are obtainable in many semiconductors. Semiconductors are a suitable environment in 
which to study the high-density electron gas. 

Many semiconductors are polar. The interaction between electrons in the conduction 
band and LO phonons is well described by the Frohlich Hamiltonian. Many of these semi- 
conductors may also be doped to high electron concentration, so that experimentally one may 
study polaron theory in a many-particle system. However, nature does not seem to like these 
two conditions—strong polaron interactions and high doping levels—to occur in the same 
materials. Materials which are very polar, such as alkali halides or II-VI semiconductors, can 
seldom be doped to where the electron gas is degenerate. Group IV (Si, Ge) and III-V 
semiconductors may be doped readily to high levels but are either weakly polar or not polar. 
The theory of polarons, in many-particle semiconductors, needs only to treat systems with 
weak coupling between electrons and phonons. An advantage of these experimental systems 
is that all the parameters of the Hamiltonian are known from other measurements. The only 
parameters of the theory are m*, no, €&ọ, and €, which all can be obtained by simple 
measurements. Many-body theory may be tested in situations where there are no adjustable 
parameters. 

The self-energy of an electron from electron—phonon interactions is dramatically altered 
by the presence of other electrons. Several effects are described. First, the phonon energies 
themselves are altered by the screening properties of the electron gas. Second, the electron— 
phonon interaction is screened by the electron gas. The screening is a significant feature, so 
that even intermediate values of « are reduced to weak coupling. Third, the nature of the 
electron’s self-energy is changed dramatically by the Pauli exclusion principle. When elec- 
trons interact with phonons, they can only scatter into states not occupied by other electrons. 
The sum of these three effects is that polaron theory is entirely changed from the simple one- 
particle theory of the previous sections. 
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7.3.1. Screened Interaction 


A little thought shows the system is complicated. The phonons and the electron gas are 
mutually interacting and affecting each other’s properties. Should the first step, in the 
calculation, be to calculate the effect of the phonons on the electrons, or vice versa? The 
answer is neither. Instead, one should calculate the total longitudinal dielectric function of the 
system, which includes parts from both electrons and phonons. This function will be suffi- 
cient to describe all the relevant physics. 

The effective interaction between electrons may be written as 


Ane Vg 


= 4 _ (7.166) 
q? Exotal (4; o) Etotal (4 ’ o) 


Voff (q, o) = 


where £ta(g, ©) is the total dielectric function from all sources. The total dielectric function 
will be derived in three ways. The first will be a phenomenological derivation. The other two 
derivations use Green’s functions. 

The simple derivation is given first. The dielectric function is the summation of the three 
contributions: €, the electron—electron interactions, and the electron-phonon interactions. 
The first two are known, so that 


Etotal(, ©) = Ego — ¥gP(g, @) + phonon term (7.167) 


where P(q, @) is the polarization of the electron gas, which was discussed in Sec. 5.5. The 
phonon term will be calculated assuming that there are no electrons present. 

For optical phonons in polar crystals, the ions will vibrate in response to an oscillating 
electric field. If X is the distance between a pair of plus and minus ions with reduced mass M, 
the classical equations of motion are 


MX + KX = eEye™ = eE(1) (7.168) 
The force constant is K = Mw. The oscillatory solution has a periodic displacement of 


eE(t) 1 
M 7 — o? 


X(t) = (7.169) 


The net polarization of the system is P = en,X, where e and ny are the charge and density of 
the ion pairs. The contribution to the dielectric function is 


X(t) | Ane*ng l 


—— = — 7.170 
E(t) M Oo- 0? ( ) 


As = 4ra = 4reno 


The collection of constants 4re?ng/M is an ion plasma frequency. It may be determined 
phenomenologically by realizing that in the absence of the electron—electron term, the long- 
wavelength dielectric function of a polar crystal may be written as 


Ey — E 
&(@) = £n tI at (7.171) 
Ane*no 
So — Foo = Mažo 


(7.172) 
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At zero frequency e(0) = £ọ while at frequencies above the reststrahl œ >> @7o then 
&(@) = £x. These limits define the two quantities &) and £. The total dielectric function is 
€ — 

Etotal (4 , @) = £% + ° = 


€ 
o to L v P(g, 1 
ETI (q, @) (7.173) 


In the derivation, the three contributions to the dielectric function are treated as independent 
and additive. This assumption is not quite right. The rigorous derivation shows that the 
phonons affect the electron—electron contribution. When calculating the polarization diagrams 
for P(q, œ), there are diagrams in which the basic electron bubble has internal phonon lines. 
However, these terms are never included in actual calculations. Instead, the term v,P(q, @) is 
approximated by either the RPA or the Thomas—Fermi model. In this approximation, the three 
terms are independent. 

The rigorous derivation of the total screening function proceeds by summing diagrams. 
The Hamiltonian has the form 


H = Hy + Hee + Hey + Hyp (7.174) 
Hy = Yo EpC Coo + © Og. ai dgn (7.175) 
po qr 
l 
Hee = =~ 2 vr PPa) (7.176) 
Va 
l t 
Hy = 3 Z VDA DAA) (7.178) 
q 
v 
P=, A =aytarg, PA =LChrqoCpo (7.179) 
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The electron density operator is p(q). The electron-phonon matrix element is M,,. The 
phonon-phonon matrix element is V, (q). The electron—electron interaction has been written 
as the product of two density operators. This form has the liability that it permits an electron 
to interact with itself. These terms are meant to be absent and are ignored. The shorthand 
notation is convenient for the following discussion. 

The term Ho describes the noninteracting systems of electrons and phonons. The elec- 
tron’s energy &, = €p — H is measured from its chemical potential. The kinetic energy is 
usually assumed to have the form £, = p’/2m*. Band structure effects are included by an 
effective mass m*. In semiconductors m* is usually smaller than a bare electron mass. The 
electron-phonon part of the Hamiltonian has the usual form. Our applications will mostly be 
to polar coupling, where M, has the form in (7.2). The electron—electron interaction has an 
interaction vy”) = v,/Ego. 

The factor €,, is included to account for the dielectric screening of the material. There 
are three sources of dielectric screening in the system. The first is the electron—electron 
interactions from the mobile electrons in the conduction band; their contribution to the 
screening is included explicitly by the term He. The second is from the optical phonons, and 
they are included through the term H,,. The third is from the high-energy electronic exci- 
tations across the band gap of the semiconductor. They give rise to €,,. They could also be 
included as an additional term in the Hamiltonian, which would be solved to give £.. 
However, for the low-frequency excitations of interest in the present discussion, these high- 
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energy excitations are uninteresting because they give a constant contribution to ¢(q, œ). It is 
simpler to include these effects through the constant €, which is introduced in the Hamil- 
tonian at the beginning. 

There are several different ways to solve these equations which give the same answer. 
The easiest method is done first. In order to keep the discussion simple, the treatment is 
specialized to longitudinal optical (LO) phonons. The ions are treated as point masses with an 
effective charge eZ. The various interaction terms are approximated as 


Oy =r My = v7 °ZqX(q) (7.180) 
h 

V: (Q = v2 @exX(q"r, Xq = — 7.181 

D= PPXA, XO = o (7.181) 


The phonon frequency @,) is chosen to be the TO frequency, rather than the LO frequency. 
The difference between these frequencies arises from the long-range Coulomb interactions in 
H,,,. As long as the phonon-phonon interactions are explicitly included in the interaction term 
of the Hamiltonian, then the TO frequency must be used in Hp. Later it is shown how the term 
H, changes Oro to M0. 

All of the interactions are due to long-range Coulomb interactions: electron—electron, 
electron-phonon, and phonon-phonon. All of the interaction terms can be combined into one 
term 


Hint = Hee + Hep + Hyp (7.182) 
=} E uo) ph(q)pr(q) (7.183) 

q 
pr(q) = PP + ZXM) (7.184) 


The total density operator pr(q) is the summation of the density operators for electron and 
phonons. 

The dielectric function is calculated for this interaction by following the same steps used 
in Sec. 5.5.2 to obtain RPA. The difference between that case and the present one is the 
phonons. By following the same steps, one can show that 


£(q, 10) = Ell — oP? (a, io) (7.185) 
a 
PP i0) =~ | dre Tprla DPA, 0) (7.186) 


The electron and phonon operators act independently. The most important terms in P® are 
the summation of the separate susceptibilities for electrons and phonons: 


P(g, io) = PP (q, ia) + P% (a, io) (7.187) 
Pip (a i0) = ZPX (a D(a, io) (7.188) 
2 2 yd 
Dr ay Oni 2 4ne z no 
—UgP on (q, i@) = Oy — œ? ’ Opi = TH (7.189) 


The electron polarization PS is identical to the RPA result in Sec. 5.5.2. This dielectric 
function is identical to the one found in (7.173). It is the RPA result for a system of electrons 
and LO phonons. This derivation is rather simple because all long-range interactions were 
included in H; 


int: 
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The third derivation of this same result is the one presented in most reviews. The first 
step is derive an effective interaction W(q) between electrons. It has two terms. The first is the 
direct Coulomb interaction yr The second term is the interaction between two electrons 
which results by the exchange of a phonon. 


W(q) = vo + V,,(q) (7.190) 
V (q) = M; Dl) (7.191) 


The phonon Green’s function 2(q) should include the phonon self-energy due to phonon- 
phonon interaction H,,,. This self-energy is V, = o? /20ro. The phonon Green’s function is 


—20ro _ —20ro 


D(q, 0) = 2 
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Zo = O79 + O7; (7.193) 


The RPA expansion for the dielectric function is now evaluated with the effective interaction 
W (q) which gives that 


W(q) Vg 
fet) =T WOPO aA 
The last equal sign states that this expression is identical to the previous one in terms of the 
total dielectric function. Proving this identity is assigned as a homework problem. 

The total dielectric function is often well approximated by the sum of the independent 
contributions. For example, the piezoelectric contribution from acoustical phonons provides 
another term in the above series (Mahan, 1972) for crystals which are piezoelectric. The 
starting point of a many-body calculation for any system with several contributions should be 
to find the dielectric function. Often a satisfactory approximation is obtained by just adding 
the separate contributions. It will be shown later that this does provide a description of the 
mixing and interference between these modes. The mixing occurs because all these various 
modes—LO phonons, plasmons, piezoelectric phonons—have longitudinal electric fields. 
The longitudinal electric fields of the various modes cause the mutual coupling. This coupling 
is well described by the dielectric function. 

A curious feature of the optical phonon part of the dielectric function (€ — €,,)/ 
(1 — w*/@%p) is that the resonance frequency is @rọ. The resonance is the frequency where 
the energy denominator vanishes. This position of resonance is in contrast to the Green’s 
function for optical phonons, which has its pole or resonance frequency at the LO phonon 
frequency @, 9. There is no disagreement between these two results. The longitudinal exci- 
tations of the system are not at the poles of e(œ) but are at the poles of 1/e(@). For example, if 
the polar solid had no conduction electrons and was an insulator, the total dielectric function 
would just be the first two terms of (7.173) or 


(7.194) 
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The poles of 1/e(@) are indeed at w, 9. Of course, in a polar solid there are both LO and TO 
phonons. However, only the LO phonons make long-range electric fields, so only they 
respond to a longitudinal electric field. The last relationship between the phonon frequencies 
and dielectric constants is due to Lyddane et al. (1941). 

The combined system of electrons and phonons has the dielectric function &,o4,(¢, ©) 
given in (7.173). The excitations of this system must be given by the poles in 1/€,o4,)(¢, ©). 
Let Z,(q) denote the complex values of œ at which these poles are found. According to the 
theorem of residues, the complex function may be expressed exactly as the summation over 
these poles and their residues R;: 


_ | — | _—R@ (7.199) 
Erota (g; Z) T Z —Z(qy 

The poles always occur in the pairs +Z; since €,,,; is an even function of œ. Each pole is then 
interpreted as an excitation of the system, with a coupling strength given by the residue. The 
expansion in poles and residues would be an exact way to describe the excitations, and their 
couplings, if 1/€,,.4, were only a sum of poles. However, previous experience indicates that 
there are branch cuts in the complex function. In the electron gas, these described the pair 
excitations, and the same excitations should occur here. The exact description of the exci- 
tation spectrum is all contributions where there is a nonzero contribution from 


S(q, ©) = im] | (7.200) 


Ng Etotal (4; œ + id) 


which would include poles as well as branch cuts. 
There are two important frequencies in the coupled system: the LO phonon frequency 
©zo and the plasma frequency of the electron gas: 


2 
oe = eM (7.201) 
Eq m* 


The coupled modes have a quite different character depending on whether @,9 > @, or 
©, > Oro. The case ®, >> @zo is discussed here. In semiconductors, the experimentalist may 
vary No, and hence œ,, so the two situations @, < @,9 Or ©, > Mo may be achieved in the 
same material. 

The total dielectric function is denoted as €,,,,;. Perhaps this should just be called e(q, œ). 
However, it is conventional to reserve the latter name for only the electron—electron parts of 
the dielectric function 


e(q, ©) = 1 — vo P(g, ©) (7.202) 
q 


In the case where œ, > Ozo, it is customary—although perhaps not necessary—to explicitly 
separate the electron—electron part of the effective interaction: 


v yi) 
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The second term on the right is the screened electron-phonon interaction. It is defined by this 
equation, so V,,__,), is the difference between Vopr and the screened Coulomb interaction. This 
definition may be manipulated by combining these two terms: 


(co) (co) 
Veg oy =—t Ph io (7.204) 
SP 1 WP? HVP 1-0 P 
4 
-— (7.205) 
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From the definition of V,, = M?D), rewrite the effective interaction as 
(co) 
Voce = vi + M2D (7.207) 
eff — g q . 
- M 
M, = — (7.208) 
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1—M?DP/e œ — œ} — 20,M?P(q)/€ 


The screened electron-phonon interaction V,,_,, is expressed as the product of the renor- 
malized phonon Green’s function D(q, œ) times the square of the screened matrix element M,. 
The phonon Green’s function contains self-energy terms arising from the polarization of the 
electron gas. The quantity P(q, œ) describes the polarization of the electron gas, while M? is 
the coupling between the electrons and phonons. It is important to note that this phonon 
Green’s function is different from the one which would be derived from Dyson’s equation by 
ignoring the electron—electron interactions. The latter result would be Dyson’s equation with 
no electron—electron interactions: 


D® 


/ — 
D = TE M2D Pp 


(7.210) 


The difference is the additional factor of 1/e in the self-energy term. The derivation of 
D(q, œ) includes electron—electron interactions properly and show that the factor of 1/e 
actually belongs. Since £ is usually greater than unity, including the 1/e factor will consid- 
erably weaken the self-energy effects. 

Another interesting feature of the screened electron—phonon interaction is that the matrix 
element M, is divided by the dielectric function £. Since M, is something in the nature of the 
electron—ion potential, it is reasonable to expect it to be divided by the dielectric function. The 
conduction electrons screen the electron—ion interaction. The ions are heavy and move with 
small frequencies compared to electronic response times. The electrons are able to follow the 
ion motion, which provides the screening. 

There is a good reason for separating the effective interaction into the Coulomb and 
screened phonon parts. When œ, >> Ozo, the dielectric function e(q, œ) which occurs in the 
phonon part needs to be evaluated at œ © @ 9. For œ, > Ozo, (4, @z0) is well approximated 
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by the static limit of e(q). Then it is a good approximation to rewrite the effective interaction 
as 


(00) 
(b) - 

V e = —-—_ + M? D(a, 7.211 
eff sa, o) * a D(q, ©) ( ) 
- M 

q 
= — (7.212) 
1  &(q) 
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In evaluating the screened Coulomb term, the full frequency dependence must be retained in 
e(q, œ). But the static limits e(q) and P(g, œ = 0) are satisfactory in the phonon term. Most 
calculations have used either Thomas-Fermi theory or the RPA for the dielectric function. 
Although it would be more accurate to use a Singwi—Sjélander dielectric function, the 
additional labor may not be worth the incremental increase in accuracy. 


7.3.2. Experimental Verifications 


First-order Raman scattering is permitted in crystals which lack an inversion center. Two 
examples are zincblende and wurtzite, which are the crystal structure of most HI-V and II-VI 
semiconductors. The Raman experiment measures the frequency of excitations which have 
the same magnitude of wave vector as the incident light. For optical frequencies, the wave 
vector is typically 10° cm~!, which is essentially the limit of q > 0 for P(q, œ). In this limit, 
the electron-gas term —v,P(q, @) becomes -o7 /@*, so the total dielectric function is 


E0 — Eg 


22 


lim Etotal (4, W) = £% + 
q—0 


The poles of 1/€,,44(@) occur where £wta(@) = 0. Setting the above equation to zero gives a 
quadratic equation for w*, which is easily solved to give 


o4 — llo? + O70 + y (02 + 02o) — 402070} (7.215) 


These two solutions are plotted as the solid lines in Fig. 7.9 as o? is changed by increasing no. 
The parameters are appropriate for GaAs, which has m* = 0.072, £% = 11.3, @7o = 268 
cm7!, and 9 = 291 cm7!. Also shown is the experimental data of Mooradian and Wright 
(1966) for GaAs; they measured the frequencies by Raman scattering. There is excellent 
agreement between theory and experiment. 

The LO phonons and the plasmons are two modes which are mutually coupled. Figure 
7.9 represents a typical crossing phenomenon of two coupled modes. It is a particularly clear 
example, since both modes are completely classical. Even if Å is restored to our equations, it 
would not appear in (7.215). It is also useful to understand the asymptotic limits of very high 
and low density. At high values of nọ, the analytical solution of (7.215) gives the roots when 
OF > Ofo 


wo, > o? (7.216) 


0 > WF (7.217) 
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FIGURE 7.9 Plasma frequency in GaAs as a function of electron density nọ, as determined by Raman scattering. 
Solid line is theory using ®79 = 268 cm™!. 9 = 291 cm™!, £ = 11.3, and m* = 0.07. Source: Mooradian and 
Wright (1966) (used with permission). 


The phonon-like mode has a frequency 79, not ®;9. The choice @ro occurs because the 
frequency difference between 79 and ©zọ is caused by long-ranged Coulomb interactions. 
The electron gas screens these long-ranged Coulomb effects whenever œ? >> fo. In this 
case, the electron gas can respond with a higher frequency than the phonons and can follow 
the motion of the phonons. The electron gas will screen the phonons and prevent long-range 
electric fields. The frequency difference between ©zo and @79 vanishes at long wavelengths, 
and the longitudinal phonon excitation has frequency @7o. Similarly, the phonons cannot 
follow the plasma oscillations of the electron gas, so the phonons do not contribute to the 
screening of the electron—electron interactions. The plasma frequency in (7.201) has a 
dielectric constant €,,, not €). The terms which contribute to s,, are high-frequency interband 
transitions, which can certainly follow these low-frequency plasma oscillations. 

The other limit of density has low values of nọ. When œ < @jo, the two roots of 
(7.215) are 


2 2 
w4 > Ojo (7.218) 
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The Lyddane—Sachs—Teller relationship has been used in the last identity. Now the phonon- 
like mode has a frequency @,,. The electron gas cannot oscillate as fast as the phonons and 
does not screen the long-ranged Coulomb fields of the phonons. The phonons have a long- 
itudinal frequency given by @; 9, which must be the correct limit as ny —> 0. Since the 
phonons can now follow the oscillations of the electron gas, they do contribute to the 
screening of the plasma oscillations. Hence the plasma frequency now contains the screening 
factor € rather than £.. 


7.3.3. Electron Self-Energies 


The motions of the electrons are affected by the other electrons and by the phonons. 
These interactions determine the self-energy of the electron. The self-energy will have 
contributions from electron—electron as well as electron-phonon interactions. They combine 
into the screened interaction, described in Sec. 7.3.1 and summarized in (7.166) and (7.173). 
It is a rather cumbersome expression in its general form. Actual evaluation, to get analytical 
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results, will require making some approximations at some point in the calculation. The only 
calculations which have been reported, and verified experimentally, are those for the limit that 
©, >> zo. In this case it is permissible to separate the self-energy effects into an electron part 
and a screened phonon part. The electron—electron part may be evaluated by the procedures 
for the homogeneous electron gas given in Chapter 5. Nothing further needs to be mentioned 
about this term. 

The present objective is to calculate the electron self-energy from electron-phonon 
interactions. The first step is to calculate the one-phonon self-energy term. For the screened 
electron—phonon interaction in (7.209), it is 


Bq M; 


(on) eq ia) Bq, io, )G(p + q, ip + io,) (7.220) 


There should also be a summation over the different phonon modes A, which is omitted, since 
the discussion will be confined to polar coupling. Later it will be decided whether this one- 
phonon term is sufficient. Perhaps more terms will be needed, with higher numbers of 
phonons. It will turn out that the one-phonon term is adequate, since the coupling is weak. 
The screening weakens the effective coupling constant. Materials which have intermediate 
values of « when they are insulators become weak coupling when the electron gas screens. 
The screening will be shown later in great detail. For the moment, the one-phonon self-energy 
is sufficient. Also defer, for the moment, the question of whether to use the Rayleigh— 
Schrödinger or Brillouin—Wigner form of the self-energy. The energy is designated as ip, 
which will be §, for the Rayleigh—Schrédinger form of perturbation theory and œ for the 
other. 

The one-phonon self-energy (7.220) will be evaluated for the degenerate semiconductor. 
A degenerate semiconductor has been doped with a sufficient number of carriers that they 
become an electron gas, with a well-defined Fermi surface, in the limit of zero temperature. 
The degenerate limit will simplify the calculation in a number of ways, since the homo- 
geneous electron-gas results are used to describe the dielectric function e(q, iœ). 

The self-energy in (7.220) has a free-particle electron Green’s function G but a 
renormalized phonon propagator 2. The procedure is to solve the phonon system assuming 
that the electrons are a free-particle gas. The self-energy terms calculated for the phonons 
have already been discussed. The phonons change their frequency at long wavelength, 
because of the screening of the electron gas. These newly found modes could be used in 
calculating the properties of the electrons; the phonon Green’s function in (7.213) could be 
used in (7.220). Although this procedure is reasonable and systematic, it makes the analytical 
calculation too difficult. Instead, an unperturbed phonon Green’s function Q is used in 
(7.220). An Einstein model is assumed for the optical phonon frequencies: 


—2Wo 


Q@(g, iw,) = 9 (io,) = 
(a io,) = IG@,) = Ss 


(7.221) 


The basic problem with (7.220) is that there is too much physics in it. It describes how the 
electron gas screens and modifies the phonons and their interactions with electrons. It does 
this sufficiently well that it is too cumbersome to evaluate without making some approx- 
imations. One may as well make these approximations at the beginning of the calculation, 
since doing it early saves a lot of work later on. The first approximation has already been 
described which is to replace the phonon Green’s function 2 by the unperturbed propagator 
Gr’, 
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The next approximation is much more drastic. The dielectric function e(q, im) will be 
approximated by its static value e(q). The frequency dependence of the dielectric function is 
ignored. This approximation is reasonable as long as the phonon frequencies are much less 
than the plasma frequency ,. The approximation e(q, œ) © e(q) is allowable whenever 
Ozo X ©. In fact, the dielectric function may now be combined with the matrix element to 
give a screened interaction V,(q) = M; 2 1g(q)°. The self-energy function in (7.220) now has 
the form 


Bia, Gp + q, ip + io) (7.222) 


5(p, ip) = -5E (S40, : 


The summation over Matsubara frequencies now has the simple form which was evaluated in 
Sec. 3.5: 


(7.223) 
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A much more complicated result is obtained when the more accurate forms of e(q, iœ) and 
Q(q, iœ) are retained in doing the summations. The simple result (7.223) is the basis for our 
discussion. 

The previous calculations of polaron theory always occurred in one-electron systems in 
which the Fermi occupation factors np could be set equal to zero. In the present discussion, 
they must be retained, and they play an important role in the nature of the final result. Only the 
case for the zero temperature is discussed, where the phonon thermal occupation factors 
Ny = 1/(e?®% — 1) can be set equal to zero. Next consider the wave vector integrals, where the 
integration variables are changed: k = p+ q 


1 — np(&) nr(&k) | (7.224) 
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A model form must be chosen for the dielectric function e(q). Of course, it would be super- 
accurate to use the Singwi—Sjélander form, but it is sufficiently complicated that a computer 
is needed to find any results. The Thomas—Fermi form is used instead. Although less accurate, 
it permits an analytical answer. Later it is argued that the answer is insensitive to the 
screening, so that the Thomas—Fermi approximation is probably adequate. However, this 
assertion has never been thoroughly checked by doing the self-energy calculation with the 
several alternate dielectric models of Chapter 5. 

The retarded self-energy has both real and imaginary parts. The imaginary part will be 
done first. It can be evaluated exactly now that enough approximations have been made. It is 
given by (ip > œ + ið) 


Im[X(p, o)] = — 
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+ npl&)3(O — E + &)] fao V.(p — k) (7.225) 


The delta functions take out the dk integrals. There remains just the angular integrals. Let 
v = cos denote the angle between p and k. For polar coupling, with Thomas—Fermi 
screening, the angular integrals are 
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The integrals are straightforward and give a lengthy result: 


Im[2(p, œ)] = — 
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The imaginary self-energy is shown in Fig. 7.10. It is plotted as a function of the energy œ 
relative to the Fermi energy for some value of p. The striking feature in the result is that Im(2) 
is zero for energies within œọ of the Fermi surface. Those electrons outside of this range can 
lose energy by emitting phonons. But those electrons within œ of the Fermi energy cannot 
emit phonons and have a zero value for Im(2). 

The gap in the allowed values of Im(£) can be understood in a simple way. The exci- 
tations with energy œ > @, above the Fermi sea are electrons, or electron-like quasiparticles. 
They can decay by emitting an LO phonon of energy wp. If they initially have an energy œ, 
then after emitting the optical phonon, they have a final energy œ — Wp . The electron states up 
to the energy Er, or (œ = 0), are all occupied, so that the final state must have an energy larger 
than this value. Otherwise the transition cannot occur since the final states are all occupied. 
This reasoning explains why œ — @,) > 0 for electron emission of a phonon, and why 
Im(Z) = 0 for 0 < œ < Wp. Phonon emission by electrons can occur only for initial energies 
at least @) above the Fermi energy. These processes are illustrated qualitatively in Fig. 
7.11(a). The allowed transition has enough initial energy that the final state is above the 
occupied levels. The forbidden transition would go to a state already occupied. 

The behavior for œ < 0 may be understood by realizing that the excitations are not 
electrons. Instead, the excitations below the Fermi energy are states where an electron is 
missing. These are called Holes in a metal. Unfortunately, the word hole in a semiconductor 
usually means an excitation of the valence band. Our word Hole, with a capital “H,” is an 
excitation of the conduction band. It has an electron missing from the otherwise filled Fermi 
sea. These Holes decay by having an electron jump into this empty state, which is illustrated 
in Fig. 7.11(b). The decay of a Hole is allowed whenever the Hole can be filled by an electron 
already present. If the Hole has initial energy © < —@p, then an electron in the Fermi sea can 
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FIGURE 7.10 The imaginary part of the electron self-energy from scattering by optical phonons. The energy œ is 
measured from the Fermi surface. 
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FIGURE 7.11 The explanation of why Im(Z) = 0 for excitation energies within @, of the Fermi surface. (a) 
Electrons with 0 < œ < @, cannot decay since the final state is occupied (this transition is listed as forbidden). (b) For 
—W,) < © < 0, hole states cannot decay since no electrons are available. 


FORBIDDEN 


emit an optical phonon and jump into this Hole state. This process is shown as the “allowed” 
transition in Fig. 7.11(b). The process is equivalent to the Hole jumping to lower energy by an 
amount (, since the zero energy for a Hole is the Fermi energy. The “forbidden” process of 
Hole decay in Fig. 7.11 cannot occur because it requires the initial electron to be above the 
Fermi sea. It is not possible at zero temperature if the system is in equilibrium. The forbidden 
hole decay explains why Im(~) is zero for —@p) < œ < 0. 

These arguments explain the energy gap in Im(2) for —@p < © < Mp. Of course, this 
gap exists only for the self-energy contribution from optical phonons. The electron will have 
nonzero values of Im(X) in this energy region arising from scattering by acoustical phonons 
and from electron—electron interactions. 

The self-energies describe the excitation spectrum of the interacting system. They are 
electron-like quasiparticles for energies above the Fermi energy and Holes below the Fermi 
energy. 

The function g,(Z) in (7.227) is a smooth function of both p and Z. This lack of structure 
is due to the screening of the electron gas, which eliminates all sharp features. Since g,(Z) is 
sufficiently dull, it is a good approximation to treat it as a constant. A simple place to evaluate 
this constant is at Z = 0 and p = 0. This limit defines a coupling constant called fọ: 


3/2 3/2 
ge ge LDH E (ML) 
f = lim tim 7 PO) = Epey (7.229) 
Im[2(@)] = —fplO(@ — 0) — O(—@ — @p)] (7.230) 


The real part of the self-energy may now be obtained by a simple argument. The retarded self- 
energy function X(p, œ) is causal and hence obeys a Kramers—Kronig relation (see Sec. 5.7). 
The real part may be derived from the imaginary part by 


Re[¥(o)] = l do Im[2(0)] (7.231) 
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The result (7.233) was first obtained by Englesberg and Schrieffer (1963). They were actually 
discussing metals with optical phonons, but the result also applies to doped semiconductors. 
The total effect of the screened polar interaction is contained in the coupling constant fọ. The 
same type of answer is obtained for any electron gas with an Einstein phonon, but the detailed 
definition of fọ is altered. A consequence of electron screening is that the details of the 
coupling constant are not terribly important. 

There have been several attempts to verify this theory, in degenerate semiconductors, 
through electron tunneling experiments. Usually the tunneling is done through the Schottky 
barrier at a metal-semiconductor interface. The metal effects are negligible, and all structure 
in the electron tunneling conductance can be attributed to the semiconductor unless the metal 
is superconducting, and then the Bardeen—Cooper-—Schrieffer (BCS) gap is apparent. But for 
normal metals, even if ferromagnetic, the metal-semiconductor junction is determined by the 
properties of the semiconductor (Conley and Mahan, 1967). 

The most direct confirmation of the theory was obtained by Tsui (1974), who measured 
the constant f in InAs as the magnitude of the step increase in the scattering rate of the 
electrons. He was measuring metal—semiconductor junctions in a magnetic field. The 
differential conductance d?I/d?V showed oscillations, similar to the deHaas—van Alphen 


d?I /dv? 


O 20 40 60 
BIAS (mV) 
FIGURE 7.12 d?I/dV? vs V data from an InAs—oxide—Pb junction. The InAs sample has nọ = 5.5 x 10!7 cm~? and 


T = 4.2 K. (a) Taken with H = 22 KG, (b) taken with H = 2 KG, and (c) the difference between (a) and (b). Source: 
Tsui (1974) (used with permission). 
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(dHvA) oscillations in a metal. Tsui’s data is shown in Fig. 7.12. The oscillations change their 
amplitude at voltages greater than 30 meV, which is the energy of the LO phonon in InAs. The 
tunneling voltage is the difference of the two chemical potentials and is the energy in the 
semiconductor which the electrons have after they tunnel from the metal. Using parameters 
for InAs, Tsui used (7.229) to predict a value for tọ = A /(2f) and compared it with his 
experimental number: 


Theory: To = 5.3 x 107” s (7.234) 
Experiment: t = (5.1 + 0.3) x 107” s (7.235) 


There is excellent agreement between theory and experiment. 

In many respects, the degenerate semiconductor is the ideal environment in which to 
study the electron gas and polaron effects in an electron gas. The electron gas can be made to 
have an effective high density at which the theory is supposed to be valid. All parameters of 
the theory, such as « or m*, can be determined by independent experiments in the insulating 
semiconductor. Theory can be done with no adjustable parameters. The experimentalists have 
systems which are relatively easy to measure and which have an electron density which may 
be varied continuously. 


7.4. METALS 


Electron-phonon interactions in metals have been studied for several reasons. The 
phonons in metals are significantly affected by the electron gas, so any study of the phonon 
system requires an understanding of the electron—phonon interaction. The electrons near the 
Fermi surface have their motions influenced by the phonons, so that many experiments which 
measure these electrons, such as transport or cyclotron resonance, are affected by electron— 
phonon interactions. The interaction also causes superconductivity in many metals. There are 
many experimental methods which have been applied to this topic, and ample reasons to do 
sO. 

There is an initial problem facing any theoretical treatment. Since the two systems of 
electrons and phonons affect each other very much, it is not obvious where to begin the many- 
body calculations. Does one first solve the phonons assuming the electrons have no phonon 
influences, or solve for the electron properties using the free propagators for phonons? That 
is, does one first solve for the phonon or the electron self-energies? Neither approach may 
work if the two systems are strongly coupled. Actually, there is a right way to begin, as 
discussed by Migdal (1958). The justification of the procedure requires, to some extent, 
knowing the solution to the problem. One should start by first discussing the phonons using 
electron states which do not contain phonon effects. Although the phonons affect the elec- 
trons, they alter electron states only within a Debye energy of the Fermi surface. These states 
are only a small fraction of the electrons in the system. However, in solving for the electrons’ 
influence upon the phonons, the average is taken over all the occupied states of the electron 
gas. This average is influenced to a negligible fashion by the few electrons at the Fermi 
surface, which themselves are influenced by phonons. A good answer is obtained by first 
solving for the phonon states using electron states which have no phonons included. Then one 
can solve for the electron states using these renormalized phonon states. The standard 
procedure is to first solve for the properties of the phonons. An electron Green’s function is 
employed which does not have a self-energy contribution from phonons. But it does contain 
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the effects of electron—electron interactions, which are very important for obtaining the 
correct physical description. 


7.4.1. Phonons in Metals 


For simplicity, the discussion is limited to metals, such as sodium or aluminum, which 
have only one atom per unit cell. The many-body system is a collection of ions and electrons. 
It is charge neutral, so the ions and electrons have the same average charge density 
|e|%) = eZn;, where nọ and n; are the particle densities of electrons and ions. The ions of 
valence Z are considered as rigid objects. The excitations of the inner core electrons can be 
neglected, since these energetic events are not induced by the motion of the ions. The core 
excitations will contribute a small amount of dielectric screening, which can be included as a 
phenomenological high-frequency dielectric constant ¢,. The Hamiltonian was discussed 
earlier in Sec. 1.3: 


H = pe 


SD 


25 Ir, _ r | + Ve —R,) + 22 ViR, Rg) (7.236) 


The potential V,,(r; — R,) is between an electron at r; and an ion at R,. It was called V} in 
Sec. 1.3. Similarly, the potential V;,(R, — Rg) is between two ions. Both of these potentials 
are unscreened, so they behave at large distances as 
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In a free-electron metal such as sodium or aluminum, the ions are sufficiently far apart that the 
Coulomb form (7.238) of V; is probably valid for all ion pairs. However, the electron-ion 
interaction V,, must always be treated realistically at small distances. The customary proce- 
dure is to use a pseudopotential (Harrison, 1966). The electron—electron interactions are also 
included in the Hamiltonian. They cause the potentials to be screened. 

The first discussion is at an introductory level, approximately the same as Schrieffer 
(1964). In this model the ions are point charges, and the electron gas is jellium. Later real 
metal effects are discussed at a more advanced level. The introductory treatment will try to 
explain the physics and avoid the bewildering notation of the correct treatment. 

The first step is to solve the Hamiltonian of the electrons in the absence of phonons. The 
ions are fixed in their equilibrium positions R®. Then (7.236) becomes the Hamiltonian of 
electrons in a periodic, neutral system: 


Hoe = Soma T. =t V(r, RP) 420 VRO RP) (7.239) 
IF] 


The solution of this problem is quite formidable. The electrons states are Bloch waves in the 
periodic potential. They must be calculated with the full effects of correlation and exchange in 
the metallic environment, which has only been done approximately. The problem is quite 
difficult, but it is not the problem at hand. For the discussion of electron-phonon effects, it is 
assumed that the electron part of the Hamiltonian has been solved. Furthermore, in this 
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introductory treatment, the electron states of our quasi-jellium model of a solid are 
approximated as plane waves. 

The Hamiltonian of the phonons is first solved without reference to the electrons. In the 
harmonic approximation (see Sec. 1.1) the ions are assumed to have small displacements Q, 
about their equilibrium positions R®, so the Hamiltonian can be written as 


R, =R? +Q, (7.240) 

H = Ho, + Hop + Hey (7.241) 
Hy = Spt} Q- QR —RP) (1.242) 
Hep = 2 Q, vre — R®) (7.243) 
DR) = VV VCR) (7.244) 


The term Ho, is called the bare-phonon Hamiltonian. These phonons are calculated by 
ignoring the electron—electron and electron-phonon interactions. The ion motions are 
calculated using only the direct ion—ion interaction potential V;. The bare-phonon Hamil- 
tonian, once solved, describes a set of eigenfrequencies and normal modes which are quite 
unrealistic. The poor results are obtained because the phonon Hamiltonian is for a system 
which is not charge neutral. When the ions vibrate, the bare-phonon Hamiltonian Hp, treats 
the electron gas as a rigid background, which does not follow the ion motion. Hence the ion 
vibrations cause long-range Coulomb fields. These vibrations are just plasma oscillations, of 
the ions, so the long-wavelength phonons will have the frequency 


4nZ*e*n, 
o2 = 24 e ni (7.245) 
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In usual solids, and metals, the long-wavelength oscillations of the atoms or ions are 
described by acoustical phonons, which have the dispersion relation œ(q) = C,q proportional 
to the wave vector, where C, is the speed of sound. In solids with more than one atom per unit 
cell there may also be optical phonons, which have a constant frequency at long wavelength. 
However, the above plasma modes of Hop are not from optical modes of vibration, since the 
present theory is for solids with only one atom per cell and no optical modes. In fact, the bare- 
phonon Hamiltonian Ho, is sufficiently unrealistic that it does not have acoustical mode 
solutions, which are necessary for real metals. The ion plasma modes are the correct long- 
wavelength normal modes of Ho, but do not realistically approximate the actual modes in a 
metal. 

In normal metals, the electrons and ions respond differently because of the great 
difference in their masses. When the ions vibrate, the electrons follow the ion motion. There 
are no long-range Coulomb fields and no ion plasma waves. When the electrons oscillate at 
their natural frequency œ, the ions are too heavy to follow, so the electron motion is not 
screened. The electrons can freely oscillate at their plasma frequency, while the ions cannot. 
Our bare-phonon Hamiltonian gives the wrong answer because it did not let the electrons 
follow the ion motion. Permitting the electrons to follow the ion motion is another set of 
words to describe the screening by the electrons of the motion of the ions. 
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The method of solving (7.242) for the bare phonons was described in Sec. 1.1. The ion 
displacements Q, and conjugate momenta P, are expanded in a set of normal modes, 


Q, = aK 2 Qge™ P (7.246) 

P, = = 2 P e™ ® (7.247) 
Hop = Dla P,-P_,+ 14,0 106,09 (7.248) 
,,,(k) = B(0) — ®(k) (7.249) 
®(k) = 3 ek R'® (RO) (7.250) 


If the ions were point charges, these terms would be 
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where G are the reciprocal lattice vectors for the crystal. The summation over G converges 


slowly, which may be helped by using Ewald techniques. The normal modes of the bare- 
phonon system are found as the eigenfrequencies Q,, and eigenstates €,, of the equation 


det| M245, — Pull = 0 (7.254) 


These frequencies and eigenstates are used to define a set of creation and destruction 
operators through the expansion in Eq. (1.85): 


| 1/2 
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where p is the density. The formal solution of the bare-phonon system is complete. The 
modes which are obtained are conceptually useful, but there is no need to calculate them. The 
set of eigenfrequencies and eigenstates are only used as the basis for solving the electron—ion 
interaction H,,. When solving the interacting system, a new equation is obtained for the 
phonon modes. These new equations contain electron screening and provide a realistic 
description of the atomic motions. These are worth solving numerically. The eigenstates of the 
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bare-phonon system are useful only as a concept upon which to build our solutions in terms 
of Green’s functions. The interaction term between the electron and phonons is written as 


1 
Hey == My (a+ Get A (7.258) 
V qàG 
z 1/2 
M, (q + G) = (sn ba * (a + G)V.i(q + G) (7.259) 
P gÀ 


The phonon states in this basis are the bare phonons. It is an unrealistic basis but serves only 
as the starting point for the Green’s function calculation. The electron states are put into the 
second quantized notation, so that the effective Familtonian has the form 


H = L Cro Cro + Fant ae 5È 2 Va Cha, Cp qs Cps'Ck,s 
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The solution to this Hamiltonian was discussed in Sec. 7.3. The present problem corresponds 
to the case where the electron plasma frequency œ, is very much larger than the phonon 
frequencies. In this case the effective interaction between two electrons may be written as 
(7.209) with a screened Coulomb interaction and a screened phonon interaction: 


Vg 
Vf (q, iœ) = e(a, ia) + MDA. iœ) (7.261) 
e(q, iœ) = 1 — “P(g, iw) (7.262) 

. g 
Dy (q, io) = 1 Meo (7.263) 


where M, is the screened matrix element M = M / E. ghe quantity 2, is the phonon Green’s 
function. Its denominator contains the factor M2 DV )P/e. The term 1/e comes from the 
screening of the electron gas. It describes how the electrons can follow the motion of the ions 
and hence screen out the long-range Coulomb forces. The screening eliminates the ion- 
plasma solutions and makes the modes acoustical. To show this, first rewrite the phonon 
Green’s function as 
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At long wavelength, the factors in the denominator of this Green’s function have the following 
limits: 


Dret(G, ©) = (7.264) 


Qn —> Win 
h AnZe? 
M, >q 5 
li rey ae (7.265) 
ım U . 
q>0 20 „M? —> 1 op 


486 Chap. 7 e Electron—Phonon Interaction 


The last term on the right shows that the phonon energy denominator becomes the simple 
expression ©” — @;,/e. For the small frequencies of phonons, the static dielectric function 
€(q, 0) may be used. In the limit q — 0 it is given by the compressibility sum rule (5.266). 
The zero of the phonon energy denominator defines the new phonon modes œ? (q) = OA /€(q) 


lim (q) = qC, (7.266) 
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The long-wavelength excitation of the coupled system of ions and electrons are now 
acoustical phonons. The speed of sound is given by C,. The formula for C, is similar to the 
result first obtained by Bohm and Staver (1951), who found that C, = 0,,/q7r = 
vp»/Zm/3M. These ordinary sound waves are obtained because the electron gas follows the 
vibrations of the ions. When the ion of charge Ze moves, an amount of screening charge 
nearly —Ze follows it, so the vibrating entity is nearly neutral. No long-range Coulomb fields 
are present, and there are no waves at the plasma frequency of the ions. The screening charge 
has contributions from all electrons in the Fermi sea. 

The situations in metals is quite analogous to that of semiconductors. There the electron— 
electron interactions caused the LO phonon modes to be given by 


2 2, Mo ~ 07o 
a” = O70 + (9) (7.268) 
To apply this formula to metals, set ©zo = Op and ®79 = 0. The TO frequency is zero in 
metals since there is no restoring force for transverse oscillations. These steps give the same 
formula w* = «;,/e(q). 

The phonon Green’s function can be written approximately as 

D, (q, ©) = on (7.269) 
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This approximation is valid only when the damping of the phonon states can be ignored. The 
phonon eigenfrequencies ,(q) are the ones found with the inclusion of electron—electron 
interactions and represent the actual phonon modes measured in the solid. The electron- 
phonon matrix element Mi, in (7.258) contains the factor Q. In the screened phonon 
interaction, the unphysical frequency Q,, completely cancels from the product Mi D;,(q, ©), 
and the final formula contains no reference to 4, whatsoever. 

The preceding theory was much too simple to find realistic phonon modes in actual 
metals. The arguments applied only to longitudinal modes of vibration and only at long 
wavelength. A better description is needed to calculate transverse modes. Dyson’s equation 
for phonons is really a matrix equation, so that the phonon energy denominator is not just a 
scalar quantity but is derived from the determinant of a matrix. 

The reader is referred to Joshi and Rajagopal (1968) for a thorough treatment of phonons 
in metals. They suggest one method of using the dielectric function for the homogeneous 
electron gas to calculate realistic phonon modes for metals. If the Fourier transforms of V,,(r) 
and V,,(r) are V,,(q) and V,,(q), the screened interaction V,,(q) potential between ions is given 
by the effective potential: 

2 P(q) 


V.(q) = V;(q) + Val) eq) (7.270) 
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The first term is from the direct interaction between ions. The second term is from the 
interaction mediated by the electrons. The latter term arises when the ions polarize the 
electrons, and this polarization acts upon other ions. The motivation for choosing this 
interaction comes from the form of the denominator of the phonon’s Green’s function, whose 
poles predict that the phonon frequencies are given by 


MyP@) 
e(q) 


The first term on the right, %> is roughly given by ,,(9)/M ~ quay V,(q)/p. The second 
term on the right is 


(7.271) 


M3 P (q) _f 7 P(q) 
20,, “= + 7.272) 
aO p Ouo l 
The two terms combined have the form 
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The term in brackets is just the effective interaction in (7.270). The Green’s function deri- 
vation has indicated this form to be the appropriate one for the potential between two ions. 
The effective interaction will be used to solve for the phonon modes in metals. These modes 
are found by solving a determinantal equation. For real metals it is not a good approximation 
to express the screened ion potential as V,,(q)/e(q). 

The phonon modes may be obtained in the same fashion as described in (7.249) and 
(7.254). A dynamical matrix is constructed from the screened ion potential ,,(q) which is 
used in the determinantal equation (7.254). That procedure was followed by Woll and Kohn 
(1962) to calculate phonon modes in aluminum. Other calculations are described by Ziman 
(1960), Sham and Ziman (1963), and Harrison (1966). Other calculations are Na and K by 
Shukla and Taylor (1976); K, Sn, and Sb by Kay and Reissland (1976); In by Garrett and 
Swihart (1976); and Na, K, and Rb by Srivastava and Singh (1976). 

A significant feature of phonon modes in metals is the Kohn anomaly (1959). In the 
RPA, the electron polarization operator for œ = 0 is (x = q/2kp) 


P(g) = — me Í + q — x’) In|; A +3 


| (7.274) 
— xX 


In the equations for œ? (q), from P® there appears the factor 
(q? — 4k}) In(q — 2kp) (7.275) 


which is zero for q = 2kp but has an infinite slope. The phonon modes have a dispersion 
dœ, (q)/dq which is logarithmically divergent at q = 2kp, which is the anomaly. 


7.4.2. Electron Self-Energies 


The electron-phonon interactions in metals have an important influence upon electron 
states near the Fermi energy. The energy scale is set by the phonon energies themselves and 
so is roughly a Debye energy. It is only 20-30 meV in many metals, compared to Fermi 
degeneracies Ep of several electron volts. The actual self-energy X(k, u) of electrons from 
electron-phonon interactions is a small energy, which is negligible compared to Ep. However, 
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its derivative dX(k, u)/du is large, so that it makes a large contribution to the electron 
effective mass. Exactly the opposite behavior was found earlier in Chapter 5 for the electron 
self-energies from electron—electron interactions: There the energies are large, but the 
contributions to the effective mass are small. 

For the electron-phonon interaction, it is now assumed that the phonon states have been 
determined satisfactorily, perhaps by the methods already discussed. Having started with a set 
of bare phonons with frequency Q,, and matrix element M,, the actual phonon states in the 
metal have been determined by numerical means. There is available a new set of phonon 
frequencies @,(q), eigenvectors, and matrix element M,, for electron-phonon interaction, 
where A is the mode index: TA, LA, etc. The matrix element is obtained as follows: the 
screened electron—phonon interaction (7.209) is rewritten in terms of the new matrix elements 
and frequencies: 


M, (q) 204, 7276 

eo) = oa. 0)* @? — Ga — 2Q4,P(q)M2/e(q) (7:276) 
20,(q)  _ 72.0) FO) 

= EMD Soa AODA o) (7.277) 


The frequency dependence in P(q, œ) and &(q, œ) is ignored since it has no influence at small 
frequencies. The preceding equation serves to define the renormalized matrix element from 


(7.258), 
,\ i? P 2. EA 
= ° el 7.278 
=) (sna) 9° Sa. Sq) (7.278) 


_ Q) 
it (q) = M,(q) ( 


e(q) 
which is expressed in terms of the renormalized frequencies œ, (q). This renormalized version 


of the screened electron-phonon interaction is exactly what we would obtain from a 
Hamiltonian of the form 


M,(q) 
H = ECs Co +O aay + a Ag > CeasC (7.279) 
po qh po 


Equation (7.279) is just a simple electron-phonon Hamiltonian using the renormalized 
phonon frequencies and matrix elements. Its significant feature is that electron—electron 
interactions are omitted. When they are included, they serve to screen the electron-phonon 
interactions. This phenomenon was used to derive the renormalized frequencies and matrix 
elements. Adding electron—electron interactions again would be double counting. It would 
give a second renormalization which is incorrect. Use the effective Hamiltonian (7.279), with 
renormalized frequencies, and do not include screening again. 

The self-energy of the electron, from electron—phonon interactions, is calculated using 
the renormalized phonons and matrix elements. The only diagram which is ever calculated is 
the one-phonon bubble diagram: 


1 - _ 
Z(k, ky) = -7> Yo MAD ZP + q, ik, + i@p,)D,(q, ion) 


qa, iO, 


2@,,(q) l 
ojis On) M P + 0,(@ ik +10, h (7.280) 
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The one-phonon self-energy makes only a small contribution to the energy of the electron. 
Higher-order terms, involving two or more phonons, are neglected on the basis that their 
contribution is even smaller, as first shown by Migdal (1958). The one-phonon self-energy 
diagram is evaluated by making several approximations. The free-particle propagator G is 
used for the electron Green’s function. The electron-phonon matrix elements M,(q) are 
treated as functions only of q, not of œ. The frequency dependence is ignored in the long- 
itudinal dielectric function e(q, im). With these approximations, the Matsubara summation is 
the same one which has been evaluated previously in Sec. 3.5 for unperturbed electron and 
phonon Green’s functions: 


E(k, ih) =D | <4 d4 


) ee +1—AplEgyg)  Mg(@,) + 2 (Extg) 
(2n eM 


EF or 281 
ik, — Čk+q — (q) ik, a Ck+q + A g ) 


The retarded function is obtained by ik, > u+ ið. The discussion is confined to zero- 
temperature properties, so set ng = 0. Consider the expression 


1 — nr(Ex+q) Nr(Sk+q) 


X(k, u) = ojas On M,(q) | Te u Etola a (7.282) 


The self-energy is customarily evaluated using the Brillouin—Wigner form of perturbation 
theory. The energy u is retained as a separate parameter, and the self-energy is calculated as a 
function of both u and k. 

The variation of X(k, u) with respect to u is far more important than with respect to k. 
The variation with respect to u is on the scale of a phonon energy, while the variation with 
respect to k is on the scale of kp. An order of magnitude estimate of these derivatives is 


a » du >» 
3> (7.283) 
Ou Op ob, ~ Ep 


where œp is the Debye energy of the solid. The derivative of 2(k, u) with respect to u is larger 
than the derivative with respect to €, by the factor E;/@p ~ 107. It is usually possible to 
neglect 02/0,, so that the effective mass and renormalization factors are given by the same 
expression: 


(™) =z, = ( _ >J (7.284) 


m* 


The electron self-energy X(k, u) from electron-phonon interactions has appreciable value 
only for u within a Debye energy of the Fermi surface, or —@p < u < @p. In this narrow 
energy range, k hardly changes from kp. For metals with a spherical Fermi surface, the k 
dependence of X(k, u) is unimportant, and the k dependence is often even suppressed in the 
notation: X(u) = X(kp, u). When the metal has its Fermi sphere cut by Bragg planes, so that 
the Fermi surface is divided into pockets, the self-energy may depend on the position on the 
Fermi surface in the region near these Bragg planes. In that case one should retain the k 
dependence. 
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The two terms in the integrand of (7.282) which contain the factor np are far more 
important than the factor which contains the 1. It is natural to separate È into two terms: 


Y= 29 4 50 (7.285) 


5@(k, u) = >| st se Cura) (a) 


1 1 
— — .286 
* È En F ua F s (7.286) 


d’ 1 
S a 
A 


(2T M (0? u — Ek+q — (,(q) + 10 (7.287) 


The term &) is a rather dull function of its two arguments. Not only is this self-energy term 
small in magnitude but so are its derivatives with respect to k and u. It makes a negligible 
contribution to the energy, effective mass, or other properties of the electron. An estimate of 
this term can be obtained in the following way. Since the chemical potential does not enter the 
evaluation of this self-energy, it behaves as the self-energies calculated for one particle in an 
empty band. It contributes to the self-energy for states near to the bottom of the occupied 
band, but is quite small for electrons near to the Fermi energy. Only the latter electrons matter, 
so Z® can be neglected. 

The other self-energy term, = in (7.286), is also rather small in value. Its importance 
comes from its contribution to the electron effective mass. The self-energy has a strong 
dependence upon energy for u ~ 0, which is near the Fermi energy. Consider the real part of 
dx” /ðu 


re(=-) = = -Djé znr (Ék) Ma) o (7.288) 
du J u= (27) a [E k+ — %0]  [Sk+q + (0) 


The integral is evaluated by integrating by parts on the angular variable v = cos0 = p - q, 
where the two factors are 


|ua = uv — [oau (7.289) 


m 7 _ 
v= ~ eg Crha + @,) E dv = AV(Ek+q = @) í 


d 
u = nr (k+): du = ay É nC) 


The integration by parts gives, with d*>q = 2nq*dqdv 


gy @ da iy | = 


l d l l 
— dv| — o l 
1 { JE ne(Cxsa)) or ktq — œ (q) & +q + z) (7.290) 
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The first term in (7.290) is smaller than the second by a factor œp/Epr and may be neglected. 
The second term is the important one. The factor dn;-(&)/d& = —6(&) at zero temperature. 
Changing the integration variable to p = k + q gives, for this last term alone, 


ax 7 M; (k — př 
Re( ðu > ae. ojas On P o kp) me) 
___* dp M,(k — py” 
7 (2n)° “x | vr @,(k — p) (7:292) 


where d?p = 2np*dpdv and f dvd(E) = 1/(pvp), which leaves d*p = 2npdp. The remaining 
integration is over the two-dimensional area of the Fermi surface. For spherical Fermi 
surfaces, this factor is d*p/up = mk-dQ,, where Q, is solid angle, but most Fermi surfaces 
are not spherical. The term [3 Re © Jdu] is quite large because of the factor wy’, since 
phonon energies are small. This important expression was first derived by Nakajima and 


Watabe (1963). It is called A, 


D pele?) 22 [CP Mak -p 
Mk) = -Re( z; ) = On >| bp o kp) (7.293) 


which is somewhat unfortunate, since À is also used to designate the summation over the 
polarization modes of the phonon system. No confusion should arise over these quite different 
parameters, however. 

The parameter A(R), as defined in (7.293), is a function of the position on the Fermi 
surface of the metal. The point k is the reference point, and the average is taken over other 
points p on the Fermi surface. Quite often the average value of À is given for a metal, which is 
obtained by averaging Mk) over the Fermi surface: 


dkak 


The other contribution to the effective mass of a particle at the Fermi surface is from the 
derivative of X(k, u) with respect to €,. However, £ /3%, is much smaller than À by a factor 
Wp/Ep, so that the most significant term is à. The other terms are usually entirely neglected, 


and the effective mass of electrons at the Fermi energy, from electron—phonon contributions, 
is given by 


(=) 14K (7.295) 
m 


The first calculations of this quantity were by Swihart et al. (1965), and by Ashcroft and 
Wilkins (1965), and many have been done since. The quantity à can be determined from 
electron tunneling experiments, so that experimental values are available. Both theory and 
experiment have been reviewed by Grimvall (1981). His suggested values are shown in Table 
7.2 for some nontransition metals. Some of the values are quite large—they are 1.6 and 1.5 
for mercury and lead. These large values show that the electron—phonon coupling is large for 
these metals. This observation is significant for the theory of superconductivity, which in 
elemental metals is due to electron-phonon coupling as described in Chapter 10. Metals 
which have a low value of A, such as the alkali metals, are not superconducting. 
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Table 7.2 Electron phonon mass enhancement À. 


Li Be 
0.41 £0.15 0.24 + 0.05 
Na Mg — Al 
0.16+0.04 0.35 + 0.04 0.43 + 0.05 
K Zn Ga 
0.13 + 0.03 0.37 + 0.05 0.97 + 0.05 
Rb Cd In Sn 
0.16 + 0.04 0.40 + 0.05 0.8 0.72 
Cs Hg Tl Pb 
0.15 + 0.04 1.6 0.8 1.5 
From Grimvall (1981). 
The phonon density of states is defined to be 
Lq 
Fo =E |o- oA (7.296) 
x J (27) 


where the summation À is over different modes, and the integration is over the Brillouin zone. 
McMillan (1968) has introduced another function, which is called «?F(o). It is treated as a 
single function, although the notation implies incorrectly that it is the product of two func- 
tions. It is defined as 


1 fdp- 
04, F() = Sap | Geek — p) 8[o — @,(k — p)] (7.297) 
Mk) = 2 | — ah F (0) (7.298) 


a? F(w) is the frequency spectrum one obtains by starting at a point on the Fermi surface k 
and integrating over all other points on the Fermi surface p. It will vary from point to point on 
the Fermi surface of a metal. The same quantity without the subscript k is the average over all 
different points on the Fermi surface: 


x F(o) = [Zar (œ) / [Z (7.299) 


Nowadays it is calculated with a computer, using realistic electron wavefunctions, realistic 
phonon eigenstates and frequencies, and realistic matrix elements. The quantity «*F(w) can 
be obtained experimentally, as will be explained in Chapter 10, from electron tunneling 
experiments. As an example, «?F (œ) for Pb and Sn are given in Fig. 7.13. These results are 
from the experimental data of Rowell et al. (1969a, 1971). The dashed line in Fig. 7.13(a) is 
F(@) for Pb, which is similar to &?F (œ) because the matrix elements vary only by about a 
factor of 2 over the various phonon states. Other physical quantities may be calculated from 
«’F(w). For example, the lifetime of electrons on the Fermi surface is 


l D l 1 
— = 4 dox F —— + ———— . 
K) T | wo, F'(o) (=e Ti + JoL r) (7.300) 
Note that 1/t(k) is zero at zero temperature but is nonzero for a nonzero temperature. This 
lifetime is between scatterings of the electron by phonon emission or absorption. It is not the 
same lifetime which enters a calculation of the resistivity, for example. These differences are 
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FIGURE 7.13 The energy dependence of the electron-phonon coupling «?F (œ) in (a) lead and (b) tin as determined 


by electron tunneling in superconductors. In (a) the dashed line is the phonon density of state F(œ) alone. In (b) the 
points indicate the spread of values for different tunnel junctions. Source: Rowell et al. (1969a, 1971) (used with 


permission). 
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treated in Chapter 8. The electron lifetime in (7.300) is derived from the imaginary part of the 
retarded self-energy in (7.281): 


Lq 
= — 2Im[X(k, u = 0)] -ay |a $ M, (Q? 


x {[1 + g(@,) — nr (Ek+q)]O(Sx+q + ©1(0)) 
+ [n8 (0) + Me(Ex4q)15(Sk-+q)]5(Sx4q — 0(0))) (7.301) 


l 
(Kk) 


Change variables to p = k + q, and the integral is 


wn mY | <2, ag Mal — PY (nolo (k + p)) + nro — P)) 


x {5(E, — @,(k — p)) + 0(G, — @ (k — p))} (7.302) 
The phonon energy can be neglected in the delta functions 6(€, + @,(k — p)) since their 


energy is small compared the the Fermi energy. Then the two delta functions are alike, and 
each just scatter the electron to another point in the Fermi surface 


1 
(k) an 


pje P it (k — pinalo) + nelo) (7.303) 


which is the same as (7.300). Results for aluminum and lead have been reported by 
Tomlinson and Carbotte (1976). The same calculation for nonzero u gives 


1 


"PE —2 Im[X(k, u)] (7.304) 


= 27 N dox F(o)[2ng(@) + np(@ + u) + nr (© — u)] (7.305) 
0 


Equations (7.300) and (7.305) are the formulas for calculating Im(®). 
Next a formula is needed for Re[X(k, u)]. From (7.286), integrate by parts on the 
v = cos 9 variable in the same fashion as was done in deriving (7.290): 


l l 
J Dne Eg FO) u Ee =a 


u— 6% (q) = 
m + 
— = Nr(Sx+q) In zE o i 


kq Ék+q — (q) v=- 
l dne(S) — Sxig + ©(q) 
d — ktg as 
+ F . de “fy — Exig — (q) 
~ u+,(q)| f 
x —In u — œ (q) |. AVO(Sk+q) (7.306) 
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FIGURE 7.14 The real and imaginary self-energies at zero temperature of an electron in lead, from the electron- 
phonon interaction. Source: Grimvall (1981) (used with permission). 


The term dn,;(&)/d& = —6(E) is the important one, and the other may be neglected in 
comparison. A careful analysis shows it to be smaller by the factor op /E;. The result for zero 
temperature is: 


u + @(k — p) 


d*p - 2 
Re[X(k, y] = -5 = O(Sp)M,(k — py In u — œ (k — p) 


x J (2ny° 


= -| doxk (œ) In a= 


(7.307) 


This expression vanishes at u = 0 and is positive for u < 0 and negative for u > 0. The usual 
procedure is to find «?F (œ) from superconducting tunneling experiments. It is a property of 
both the normal metal and the superconductor. It is used in (7.305) and (7.307) to find the 
electron self-energy from electron-phonon interactions. Corresponding experimental infor- 
mation may be obtained from electron tunneling in normal metals, as shown by Rowell et al. 
(1969b). Their results for Pb are shown in Fig. 7.14. The self-energy has no k dependence 
since they use an averaged value of «?F(w). Re[X(u)] is quite small—of the order of milli- 
electron volts. However, it has a very steep slope at point u = 0, which gives d/du its large 
value. 


PROBLEMS 


1. The polaron self-energy —«@,) contains some kinetic energy of the electron and some potential 
energy. Use the Rayleigh—Schrodinger wave function to show these have the ratio of 1:3 at p = 0. 


2. Show that the total momentum of the polaron in (7.18) commutes with the Hamiltonian in (7.2). 


3. Calculate m/m* for the Fröhlich polaron in the Tamm—Dancoff approximation. 


496 Chap. 7 e Electron—Phonon Interaction 


4. Use the two-phonon wave function, in Rayleigh—Schrédinger perturbation theory, to find the « 
term for the number of phonons in the polaron cloud at p = 0. Be sure your wave function is normalized 
to unity. You should get —0.1036«*, so correlations reduce the number of phonons in the cloud. 


5. Calculate Xps(p) at nonzero temperature for all values of €,. Also calculate its imaginary part, 
which is the imaginary part of the retarded self-energy with E = €,, which is also half the mean-free- 
path for one-phonon emissions or absorptions. 


6. Use the answer of Problem 5 to plot, on a piece of graph paper, Las(e,) for « = 1 and 0 < £, < @. 
Evaluate the self-energy in the Tamm—Dancoff approximation for the same parameters, and plot it on the 
same set of graph paper. 


7. Consider the one-phonon self-energy in Rayleigh—Schrédinger perturbation theory at nonzero 


temperatures: 
d? N, +1 N, 
Lrs(P) = [<4 M; | —+——_ + —__+—_ (7.308) 
(27) Ep — Epig T Oy Ep — Ep+q T Oy 
At high temperatures, make the approximation N, œ kgT/@,, and keep only the term proportional to 
kT. Evaluate it by assuming a Debye spectrum and a piezoelectric matrix element (1.256). You should 
find that the self-energy vanishes when v, is less than the speed of sound. 


8. Minimize the ground state energy, in strong coupling theory, with the variational wave function 
(r) = A exp(—fr). Show that the energy is —0.0980«7« , which is not as low as found with a Gaussian 
wave function. 


9. In strong coupling theory, calculate the energy of the lowest p state by using the variational wave 
function (r) = Ar cos(8) exp(—B’r?), where B is a variational parameter. 


10. In the limit of small wave vectors, the interaction between an electron and the homogeneous 
electron gas has the form 


2 
Vg G — 1) _, a" (7.309) 


The electron—plasmon interaction is identical in form to the Frohlich polaron Hamiltonian. What is the 
equivalent « for the electron gas? Find the numerical values for metallic sodium (r, = 3.96) and 
aluminum (7, = 2.07). 


11. Consider a small-polaron Hamiltonian which has the phonons localized on each site. This model 
could apply to molecular crystals, where the molecular vibrations are highly localized: 


H = gjs 2 CÌ C; + @pala; + MOC] C;(a; + a)| (7.310) 


Show that this may be derived from (7.95) with the choice M, = constant. Find the canonical 
transformation which diagonalizes the last two terms in the Hamiltonian. Use the transformed 
Hamiltonian to calculate: 


(a) The thermally averaged amplitude and probability that a hop occurs without any change in the 
number of phonons at each site. 

(b) The thermally averaged amplitude and probability that the hop occurs without any change in 
the number of phonons but that they may be exchanged between the two sites. 
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12. Show that the effective interaction (7.173) with the dielectric function (7.174) may be written as 


E 1 Q@y 1 i 
Vet = "4! = 5B Oy a Neng 0,P e ~0,P 
œ — Uy (q) —@* \E — VP & — vP 


where Q(q) is the renormalized phonon modes of (7.220). At œ = 0, then Vefe is v,/(E9 — v,P), and at 
© — 00 it is v,/(E. — VP) (Appel, 1966). 


13. Prove that the following two expressions are identical for polar scattering by optical phonons 


Vig = 2 = 
oF total | — WP 


(7.311) 
where £ota 1S given in (7.173) and W is given in (7.194). 


14. The Raman scattering of Fig. 7.11 occurs because the light couples nonlinearly to the phonons but 
not to the plasmons. Assume a term in the Hamiltonian of the form 


IY [BL by Ay 4 + he] (7.312) 
kk’ 
which describes the scattering of light from k’ —> k with photon operators bi by and phonon operators 
Ay = dq + aq. 
q q q 


(a) Derive a correlation function which describes the Raman scattering to order 7°. 
(b) Solve it at long wavelengths to get the Raman intensity of the modes œ, in (7.214) as a 
function of ng. 


15. Compare Rayleigh—-Schrédinger and Brillouin—Wigner perturbation theory by plotting both 
theories using the self-energy in (7.233): 


RS: Ep) =, +Re(X(E,)] (7.313) 
BW:  E(p) =&, + Re(Z[E(P)} (7.314) 


Use the parameters fọ = 10 meV and @) = 30 meV. Which theory appears more reasonable? 


16. Use the RPA for €,(qg, œ) to estimate the damping of phonons from electron-hole creation in a 
metal. 


17. Prove that the sum rule f dowe?F (œ) is independent of phonon properties (McMillan, 1968). 
Evaluate the sum rule analytically for metallic hydrogen, and estimate the formula numerically. 


18. Derive Eq. (7.305) for t(k, u)'. Show that it goes to a constant at u —> ©. 


19. Use (7.305) and (7.307) to show at zero temperature that Re[X(u)| and Im[2(u)] obey a Kramers- 
Kronig relation. Calculate by direct integration of (5.297) and (5.298). 


20. Show that in the jellium model of a metal, with the ions as point charges, the longitudinal modes at 
long wavelength are described by a total dielectric function [see (7.174)]: 


o; 
Erotal = & | 1 — =? — v,P(q, ©) (7.315) 


With this choice, one can write the effective interaction in (7.173) as the correct electron—electron and 
electron—phonon parts. 
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21. If X(7,k, &) is the retarded self-energy at temperature T, show that the electron self-energy in a 
metal from electron—phonon interactions obeys 
dn F (£^) 
ds! 


Re[X(7, k, &)] = | ae] frezo, k,e—e')] (7.316) 


22. Use (7.300) to show that at high temperatures 1/t = 20Ak,T. 


Chapter 8 
dc Conductivities 


8.1. ELECTRON SCATTERING BY IMPURITIES 


This chapter is mostly concerned with the methods for calculating the electrical conductivity. 
Four different methods are discussed: (1) solving the Boltzmann equation, (2) evaluating the 
Kubo formula for the current—current correlation function, (3) evaluating the force—force 
correlation function, and (4) solving the quantum Boltzmann equation. For scattering from 
fixed impurities they all give the same answer. For scattering by phonons two different 
answers are obtained. One is called the Ziman (1960) formula, and the other the Holstein 
(1964) formula. Two criteria are important in comparing these methods: which is the easiest 
to use, and which gives the most accurate answer? 

The electrical resistivity, or conductivity, from impurity scattering is an important topic. 
From the experimental viewpoint, all solids have impurities which make a contribution to the 
total resistivity. In many metals and semiconductors, the low-temperature resistivity is 
dominated by impurities, since all other contributions are temperature dependent and vanish 
at low temperature. In metals, the resistivity from impurity scattering is largely temperature 
independent, except for temperature variations on the scale of the Fermi temperature 
Tr = E;-/kg. The subject is important, on the theoretical side, because it was one of the 
earliest evaluations of the Kubo formula. The importance of vertex corrections became 
apparent. Indeed, the derivation showed that vertex corrections are usually very important and 
should be assumed important until shown otherwise. This conclusion, and message, continues 
to be relevant even for calculations of other quantities. 

The present chapter is really about vertex corrections. 


8.1.1. Boltzmann Equation 


The electrical resistivity from impurity scattering is easily derived by using the Boltz- 
mann equation. This derivation is presented for several reasons. First, it is the easiest way to 
get the answer. Second, the resistivity was first found this way; the Green’s function evaluation 
of the Kubo formula only confirmed the result known earlier from transport theory. The 
Green’s function derivation is complicated and subtle, and it is useful to know and believe the 
right answer in order to recognize it when it is finally derived. 
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Our objective is to derive a formula for the electrical resistivity with the least possible 
fuss. The simplest possible model is adopted for the solid. It is a homogeneous system except 
for randomly located impurities. The electron states are plane waves except for occasional 
scattering from isolated impurities. The impurities are very dilute, so that interference 
between successive scatterings can be neglected. In the Boltzmann theory, the electrons are 
described by a classical distribution function f(r, k, ^. The time rate of change of this 
distribution function is governed by the Boltmann equation 


af _ of 


df 
7 dt at a acl +a Vis + (5; Ja 6.) 


dt 


The last term is the time rate of change due to collisions with the impurities. There is no r 
dependence in f(r, k, A since the material is assumed to be homogeneous. Also, for the dc 
conductivity, there is no time dependence. The system has a weak external electric field and 
the current flows in a steady-state fashion. The distribution function is only a function of wave 
vector f(k) and obeys the equation 


ak df 
8.2 
o= ðt Vi f + E ) collisions l ) 


In a solid, the factor dk/dt is equivalent to an acceleration which is equal to the forces on the 
electron (see Kittel, 1963): 


— = —eK — =v x Hy (8.3) 


In the present problem, there is an electric field E and no magnetic field Hp = 0, so that 
d 
eE: V, f(k) = (4 L) (8.4) 
dt collisions 


The collision term is the most interesting. It is evaluated in the relaxation time approximation. 
This approximation assumes that collisions seek to return the system to the equilibrium 
configuration f(k), which is the configuration the system would have in the absence of an 
electric field. The rate of change of f(k) due to collisions is assumed to be proportional to the 
degree that f(k) is different from fọ(k): 


df W-K] 
(5) collisions E Tr (k) 68.5) 
2 
pE) = FE (8.6) 


The factor of two in fọ is due to the spin degeneracy. The above equation defines the transport 
relaxation time t,(k). A more detailed derivation can be found in Ziman (1960). Here an 
equation for t,(k) is derived and solved. Then the distribution function is 


f(k) = folk) — et (KE: V} f(k) (8.7) 


When the electric field is small, only a small amount of current flows. The system is only 
slightly out of equilibrium. The distribution function f (k) = h(k) + fi (k), where the change 
Ji(k) is small. It is only necessary to retain terms of O(E). Iterate the above equation, which 
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effectively replaces f by fọ on the right-hand side. Then f(k) is evaluated to O(E). 
Furthermore, it is assumed that the system is isotropic and f(k) = p(k) 


S (k) © folk) — et (K)E - V; folk) (8.8) 


E-kdfi(k) 


~ folk) — et,(k) —— m “de, (8.9) 


The electrical current density J is the product of the electron charge e, the electron’s density 
no, and the average velocity (v), which is obtained by averaging over the electron distribution: 


d'k Ak 
dk | 
Ny = ea (k) (8.11) 


The distribution function f(k) is normalized to give the electron density no. By using the result 
(8.9) for f(k), the term fọ gives an average (v) of zero, since no current flows when there is no 
electric field; as many electrons are going one way as another. The current is proportional to 
the second term, and it is proportional to the electric field: 


E EO 
J= p- ON »( e) (8.12) 


In a homogeneous, isotropic system, the current J flows in the direction of E. The quantity 
f(k) is independent of k direction. The only angular factors are v,(v,°E). The angular 
integrals will average this to v7E/3 in three dimensions. The conductivity o is the ratio of J to 
E: 


g aa Pk i(- HW) (8.13) 


ny t,(k)v dé, 


It is a positive quantity since df, /dé, is always negative. Equation (8.13) is the basis of all the 
calculations. There remains the important task of deriving a formula for the relaxation time 
T,(k). It is not just the time between scattering events, which is derived from the imaginary 
part of the retarded self-energy. This distinction is important, since it makes life difficult. The 
relaxation time in the Boltzmann equation is a special quantity. 

The impurities are assumed to be static, fixed, objects with a spherically symmetric 
potential. They have no internal excitations, so the electron scatters from them elastically. The 
impurity causes the electron in state k to scatter to k’, which has the same energy, so that 
|k| = |k’| and £, = gy. The net rate of scattering out of the state k is the rate of going from k 
to k’, which is proportional to f(k)[1 —/(k’)] minus the rate from k’ to k, which is 
proportional to f(k’)[1 — f(k)]: 


(2 _ LICK) = fol] _ ak 
(aa 7 T/(K) = mn: = (27 y 


x {Tae POE — FI Ter PAK -AEN (8.14) 


z O(E, — Ey) 
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where n; is the concentration of impurities. The quantity 7,,, is the 7matrix element for 
scattering from k to k’ which was defined in Sec. 4.1. It is symmetric in its indices 
Tkw = Ty,, which simplifies the above equation 


BE — &e Ta LF) — FY] (8.15) 


[f(k) — folk) PK 
ti(k) = 2mn; | (2n)° 


The integrand contains the factor f (k) — f(k’). The integral is evaluated by assuming the form 
in (8.9), which is written as 


f(k) = fy(k) +k: EC(k) (8.16) 
f(k’) = folk) +k ECK) (8.17) 
f(k) —f(K’) = (k — K’) EC(k) (8.18) 


Since |k| = |k’| then and (kK) = fo(k). The quantities f(k) and f(k’) differ only in the 
angular part of the second term. The angular part is treated as follows: define a coordinate 
system in which the 2 direction is k, so that 


k» Ê = cos 0 (8.19) 
k-k = cos (8.20) 
k’- È = cos 8 cos @' + sin 0 sin 6’ cos b (8.21) 


where the law of cosines is used in the last identity. The difference of the two distribution 
functions is now 


f(k) — f(k’) = kEC(k)[cos 8(1 — cos 6’) — sin 8 sin 8’ cos $] (8.22) 


The last term on the right, which contains the factor cos p, will vanish when doing the integral 
{ dd. There is no other pọ dependence in the integrand of (8.15), and the average of cos ọ is 
zero. The remaining term may be written as 


| do f(k) — f(K’)] = kEC(k) cos 6 Jaa. — cos 6’) (8.23) 
= [ f(k) —fo(’)] faoa — cos 0’) (8.24) 


The term f (k) — f(k) is factored from both sides of (8.15), which leaves the definition for the 
reciprocal of the relaxation time: 


l = 2nn [<4 ae — ¢,)|Ti | (1 — cos 0’) (8.25) 
t(k) J (Qn) RON l 
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The important factor in the integrand is (1 — cos 0’) = 1 — kk’ /k’. It makes the relaxation 
time in the Boltzmann equation different from the usual relaxation time t(k), which is the 
time between scattering events. The latter quantity is simply 


1 ark’ B 2 
1 = 2nn; lo" d(€; Ex) | Tax: | (8.26) 
= ann; ¥_(21 + 1) sin? (k) = v,n,0(k) (8.27) 
mk l 
o(k) = = >. (21 + 1) sin’ 8,(k) (8.28) 
] 


The usual inverse relaxation time is just the scattering cross section o(k) times the particle 
velocity v, times the impurity density n, An equivalent result may be obtained for the 
relaxation time t,(A) of the Boltzmann equation. The T matrix is expanded as in (4.47) and 
(4.52) for the case where |k| = |k’|, 


Tae = — = S(21 + 1)P,(cos O)e®® sin(5,(k)) (8.29) 
l 
l — Attn; ©. 2 
TO mk 2 I sin'[ (K) — 8; (k)] (8.30) 


where 9 is the angle between k and k’. The angular integrals in (8.25) are straightforward but 
cumbersome, so only the result for t, is just listed. The formula for 1/t, has a different 
combination of phase shifts than the formula for the cross section. 

The factor (1 — cos 6’) weights the amount of scattering of the electron by the impurity. 
Small-angle scattering, where cos @’ ~ 1, is relatively unimportant in contributing to 1/t,. 
These events do little to impede the flow of electrons and so contribute little to the resistivity. 
The factor (1 —cos6’) obviously favors large-angle scattering events, which are more 
important for the electrical resistivity. The relaxation time in the transport equation is not 
identical to the average scattering rate because there is an additional factor to weight the 
amount of scattering. 

An example of evaluating the conductivity in (8.13) is given for a free-electron metal. At 
low temperature the distribution function (8.6) becomes a delta function in energy which sets 
k = kp 


dfo ) 
—— } = 26(&,) (8.31) 
( dé, ' 
The angular integrals have already been done, so that 
2e 4n ent (kr) 
= 3 Ony vymkpt(kp) — F (8.32) 


where ny = k}./3n*. The relevant relaxation time is for electrons at the Fermi surface. Yet the 
conductivity is proportional to the density ng of all conduction electrons and not just to those 
at the Fermi surface. This surprising result is quite reasonable once the physics is understood. 
When the electric field is first imposed, the equation k = —eE shows that all electrons in the 
Fermi sea start accelerating equally. The Fermi sea is translationally shifted in wave vector 
space. The scattering tends to relax the Fermi distribution back to its undisturbed config- 
uration. As shown in Fig. 8.1, electrons in the leading edge of the displaced Fermi distribution 
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FIGURE 8.1 The circle represents the Fermi sea, which begins to move in response to an applied electric field. 
Steady state is maintained by relaxation back to other points on the Fermi surface. 


are scattered back to the rear regions. Only those electrons at the Fermi surface can scatter. 
The electrons well below the Fermi surface cannot elastically scatter, since all states with the 
same energy are already occupied with other electrons. Above the Fermi surface there are no 
thermally excited electrons. Only electrons at the Fermi surface are available to elastically 
scatter to other points on the Fermi surface. The lifetime only involves 1,(k;). 

The conductivity is relatively insensitive to temperature as long as the density of states of 
the metal is a smooth function of energy near the Fermi surface. The resistivity p = 1/o is the 
inverse of the conductivity, and is proportional to the concentration of impurities. This 
proportionality is experimentally verified. 

Since impurity scattering is elastic, it does not change the energy of the electron. As the 
current of electrons moves through the solid, each electron gains energy from the electric 
field. How does the electron lose this energy, if it only scatters elastically? The next few 
paragraphs will answer this question. 

Let v = cos 0, where @ is the angle between k and E. In order to keep the discussion 
simple, it is assumed that the distribution f(k) is isotropic in the absence of a field. When the 
field is present, the distribution function can be expanded in a Legendre series in v. The first 
few terms are 


SE =f, + Vlk) + Pa(v) falk) + ++ (8.33) 


where f,(k) is the isotropic part of the distribution while f,(k) is the /=1 part of the 
distribution. Note that f,(k) is not the equilibrium part of the distribution, which is fọ(k). The 
electrical current is determined by the distribution f. 

The conduction process can be viewed as having the following steps: 


1. For t < 0, E = 0 and the initial distribution is fọ. 

2. At t=0 the field E is switched on. It accelerates the particles and creates the 
distribution fp. 

3. The elastic scattering takes the particles from the p distribution f, to the s distribution 
f,. This step has a time constant T,. 

4. The isotropic distribution f, relaxes back to the equilibrium distribution fọ. This step 
has a different time constant. 


The energy relaxation occurs in the last step where f, is brought to equilibrium. The 
electrons can lose energy to their heat bath, usually phonons. This process has a very different 
time constant than t, and is usually much slower. The relaxation time 1, from elastic scattering 
determines the rate at which particles scatter out of the p distribution f, into other distributions 
such as f, and f4. The current is determined by q, since it gives the steady state amplitude of f. 
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The energy relaxation occurs elsewhere in the chain of events. The rate for energy relaxation 
tT, is calculated below. 


8.1.2. Kubo Formula: Approximate Solution 


The electrical conductivity can be calculated from the Kubo formula by using the 
technique described in (3.390)-(3.393). The correlation function is evaluated for nonzero 
temperatures and frequencies: 


1 pe 
n(io,) = — ai dre" (T(t) j0) (8.34) 


The retarded function 1,.,(@) is obtained by letting im, — œ + ið, and the dc conductivity is 
given by the limit of œ — 0: 


I 
o = — lim [Ee (8.35) 
o—>0 (A) 

This correlation function for o is evaluated for the same model system described in Sec. 
8.1.1. There is a free-particle system with a dilute concentration of simple scattering centers. 

Equation (8.34) is evaluated for the following Hamiltonian and current operator: 


H = 2 E Cio Cpo + V (8.36) 
v= E VODA (8.37) 
pa) = enS (8.38) 
P = È Chao Cpo (8.39) 
j= A 3 pCtoCoo (8.40) 


The impurities are at positions R;, and an average will be taken over the possible distributions 
of impurity positions. This averaging technique was described earlier in Sec. 4.1.4. 

The theoretical calculation is divided into two parts. The first part, in this section, is 
simply to reproduce the Boltzmann result which was derived above. How is a conductivity 
derived which is proportional to the relaxation time 1,? The derivation entails a summation 
over a set of vertex diagrams. The treatment is kept as introductory as possible, since it is one 
of the first summations over vertex diagrams. A formally exact solution to the correlation 
function due to Langer (1960) is presented in Sec. 8.1.3. 

The logical way to evaluate the correlation function (8.34) is as a power series in the 
concentration of impurities. Averages over the impurity density operators in (4.91) are 
expressed as a power series in the number of impurities N;: 


falis ++ ++ da) = (PG) )P(G2) - -+ PAG,)) 
= Nidxq + N783q,53,q + cee (8.41) 


At first sight it appears possible to evaluate (8.34) by just expanding the S matrix and 
collecting all terms proportional to N;, then N?, etc. A simple expansion in powers of N; does 
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not work, as is apparent from the desired answer. It has o œ t, œ 1/n;. The first term has the 
conductivity inversely proportional to n;, which is impossible to obtain as a simple series in N, 
except by summing a set of diagrams. The expected answer is only going to be obtained by 
summing a series of diagrams. 

The correlation function (8.34) is written in the interaction representation as 


T(io,,) = al dtel" (T S(B)e je - j) (8.42) 


The logical way to evaluate this expression is to expand the S matrix and consider each term. 
The first term has S = 1, and this correlation function is called mo: 


e2 
Ti (i@,,) = -7A m dīte®™" 2 pp (T,C} o(t)C, s(1)C}, y (0)Cpo(0)) 
pp'oo’ 


2 (8.43) 
e 


— 3m2v 


f dre®" > al dren GO(p, t)G(p, —T) 
0 p 0 


This expression is zero unless im, = 0, since the number operators ChoCoc are T-indepen- 
dent. The t-integral gives zero unless iw, = 0 and then it gives B. The term no gives a 
conductivity of zero. The result is not surprising, since it is the correlation function of 
noninteracting particles, and they have zero resistivity. Perhaps a better answer is the corre- 
lation function for the conductivity is infinity. The zero is sufficient to alert us that a 
nonsensical question was asked, and a nonsensical answer was obtained. A resistive system 
requires putting damping into the particle motion. 

The next logical step is to replace all GY by Y. The self-energy of the particles, from 
impurity scattering, is included in all the particle Green’s functions. Of course, & is obtained 
by summing a series of diagrams, which is Dyson’s equation. The self-energy X(p, ip,,) from 
impurity scattering was evaluated in Sec. 4.1 in the limit of low n,. It 1s a retarded function 
with real and imaginary parts, where the imaginary parts are due to the damping of the 
particle motion. The step of replacing 9 by Y does put in damping of the particle motion. 
The first correlation function which will be evaluated is shown in Fig. 8.2. It is a simple 
bubble diagram, with the smooth lines denoting Y and the two vertices having the vector 
vertex p. This correlation function is called n (ia): 


rlo) = 


E G(p, ip + iw)4(p, ip) (8.44) 
B 


The wiggly lines at the two ends of the bubble, which are connected to the vertices, represent 
the incoming frequency iw,. The first step is to evaluate the summation over Matsubara 
frequencies. 


v 
vv 


FIGURE 8.2 


Sec. 8.1 e Electron Scattering by Impurities 507 


The procedure for doing this summation was described in Sec. 3.5. The easiest way is to 
represent the interacting Green’s functions by the Lehmann representation 


Sio) == PEI. ip + io) ip) 


dé, E> l l 
—A(p, —=A(p, — . 
- (Foe 4P ape 4D 
dé, E2 np(E2) — np (E1) 
= (Fap e) [F240 e aee (8.46) 


The next step is to convert to a retarded function (im, —> œ + ið) and then to take the 
imaginary part: 


Im[S(o)] = —n E #1 A(p, e) |S dE 4(p, 65){np(€s) — np(€)}3(@ + & — £1) 
=— dae €2)A(P, & + ©){n; (£2) — np (£2 + ©)} (8.47) 


Im[n®(0)] = 


or |S “2 4(p, E,)A(p, &2 + @){n; (£2) — nr (€2 + ®)} 


The next step in the derivation (8.35) is to divide by œ and then to take the limit œ — 0. The 
important frequency dependence is in the last factor: 


_ I dnj-(€>) 
lim — {np(&) — nr (e + @)} = — ie (8.48) 
and the conductivity from this contribution is called o© 
e fdp de dnp(€) 
(0) _ 2 A? _ F 
o =3,2 |< $ zP z (p, »( e ) (8.49) 


In the last step, the limit v —> co changed the summation over p into a continuous integral. 
The right-hand side is positive since dnp/de is negative. Before discussing this result, it is 
useful to review the order of the steps in the derivation. They will be used in all evaluations of 
the Kubo formula: 


1. Do all summations over Matsubara frequencies ip,,. 

2. Analytically continue io —> œ + ið to get the retarded function 7,,,(0). 
3. Take the imaginary part Im[r,,,(@)]. 

4. Divide by œ, and then take the limit œ — 0. 


These steps cannot be taken out of order. Equation (8.49) for o has several interesting 
features. The factor dn; /de = —6(e) at zero temperature, which is rather convenient, since it 
serves to eliminate the integral over de. The sharp step in np(€) is in contrast to the 
momentum distribution (3.135), 


np = Z np(£)A(p, £) (8.50) 
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which no longer has a discontinuous step at p = kp since the impurity scattering causes a 
general smearing of this distribution. However, the energy distribution n;(€) is always a sharp 
function of £ at zero temperature regardless of the interactions. 
Another crucial feature of (8.49) is that the spectral function is squared, A(p, £)". That 

this is important may be shown by examining the limit as the impurity concentration n; 
vanishes. Since the self-energy is proportional to n; it will vanish in this limit. Define 
A = —Im(%), and then 

lim A(p, £) = li 2A = 2nd(e — é) (8.51) 

noo” able — E, — Re) + A P | 
The spectral function becomes a delta function when n; —> 0. This limiting behavior is 
reasonable, since in the absence of self-energy effects, the particles are all free, and the 
spectral function is indeed a delta function. The question at hand is what happens to 4? as 
n; — 0, since it appears to go as the square of a delta function. In fact, A* does diverge as 
n; — 0, which makes the conductivity diverge to infinity when n; — 0. A method is needed 
for handling this divergence. The answer is provided by considering the integrals 


©% doa 2A 

| eo a= (8.52) 
° do 2A 2 
| Sls 2 a] -4 (8.53) 


where œ = € — &, — Re(È). The first integral has the right behavior as A — 0, since it gives 
the same result as does A = 2A/(@? + A”) = 275(m). The second integral suggests the 
replacement 


2A V 2n5 
lim A? = lim ( ) — jim 222) (8.54) 
A->0 A->0 A>0 A 


which will give the right behavior as A —> 0. The replacement 4? —> 278(w)/A will be made 
in the limit as n; — 0. Furthermore, the quantity 2A is recognized as the inverse scattering 
time of electrons on the Fermi surface, 


| 

2A(ke, € = 0) = ——~ = —2 mÈ 8.55 
The conductivity formula may now be written as 
2e f dp 

o = | — eE, p 8.56 

0 | GO) (8.56) 


This equation looks like the nght answer for o since it seems to have exactly the same 
combination of factors as (8.13). But there is a very important difference between (8.56) and 
the Boltzmann result (8.13)—1in the relaxation time. The formula (8.56) has a relaxation time 
without the (1 — cos 6’) factor, since the relaxation time in (8.56) is from Im(X), which is just 
the average time between scattering events. The (1 —cos6’) factor was important for 
weighting the large-angle scattering processes, which were important for the resistivity. The 
preliminary answer (8.56) is not the Boltzmann result and has serious deficiencies. 

The above derivation has one achievement. It succeeds in deriving a term in o which is 
inversely proportional to n,;. The relaxation time t(p), although the wrong one for the 
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conductivity, at least has the virtue that it is inversely proportional to n;, which makes o 
also inversely proportional to n,. 

The calculation has not yet yielded the right result. More diagrams need to be evaluated. 
Among those remaining, there must be a subset which, when evaluated, will give the right 
answer. The S matrix is expanded for impurity scattering, and the remaining terms contain 
higher powers in the impurity interaction V At first it appears that higher powers in the 
impurity interaction must imply that the additional terms are higher powers in the impurity 
concentration n;. This conclusion is false, as is obvious from the answer. The final o is 
proportional to 1/n;, while our preliminary term o® is also proportional to 1/n;. The 
important correction terms in the S-matrix expansion must also yield terms in the conductivity 
which are proportional to 1/n,;. The S-matrix expansion is examined to find the terms which 
cause © to diverge as 1/n; when n; — 0, although these terms must come from higher terms 
in the S-matrix expansion. Higher-order terms in S can be proportional to 1/n; if they also 
contain higher powers of the spectral function A”. 

The correlation function o, contained Green’s functions Y which include all self- 
energy effects. The remaining diagrams are called vertex corrections. They are defined as 
diagrams in which the impurity scattering links the Green’s functions on both sides of the 
bubble. Some vertex diagrams are shown in Fig. 8.3(a). There is a single impurity with a 
varying number of scattering events from the electron line on either side of the bubble. If 
there were no scattering line connecting one side of the bubble, the diagram would be a self- 
energy term on the other side. A diagram in which the two electron lines, on both sides of the 
bubble, scatter from the same impurity cannot be a self-energy diagram of either one and so is 
called a vertex diagram. Figure 8.3(a) only shows vertex diagrams with a single impurity 
participating in the scattering. Vertex diagrams can occur with scattering from several 
impurities. These are equally important and are considered later. 


FIGURE 8.3 
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The sum of all the diagrams in Fig. 8.3(a) can be evaluated in a simple way and gives the 
correlation function: 


W) (ip, ip + id) = n;T yy (ip) Typ ip + io) (8.57) 
(1) . 2e? l I . . . J o 
T (iO) = 5 DG LPP YP, ip)A(P, ip + iw)F(p’, ip) 
3m*v" pp’ B “ip 


x Gp’, ip + ia) Wy (ip, ip + io) (8.58) 


Equation (8.58) is easy to prove once the rules are recalled for scattering from a single 
impurity: (1) total momentum is conserved, and (2) energy is not changed from ip or ip + io. 
The second rule says that all electron lines in one string, say the top, are at the same energy ip, 
while the ones on the bottom are at ip + im. The momentum conservation requires that the 
momentum transfer on the top electron line be p — p’, which is exactly the opposite of the 
other: p’ — p. The evaluation is easy, since the two sides decouple, as is illustrated sche- 
matically in Fig. 8.3(b). The effective interaction W®? is represented by a diamond in the 
figures, which is illustrated to the right of Fig. 8.3(a). The diamond is the total vertex scat- 
tering from a single impurity. 

Figure 8.3(c) shows the sum of correlation functions which have increasing numbers of 
diamonds in them. These contributions are called ladder diagrams. Summing the ladder 
diagrams achieves the objective of a relaxation time with the factor (1 —cos6’). It is 
important to realize that the sum of diagrams in Fig. 8.3(c) is not the only contribution with 
scattering from several impurity sites. An example of a nonladder diagram is shown in Fig. 
8.3(d). This contribution is not included in the series shown in Fig. 8.3(c). It is of order O(n?) 
and is neglected. The sum of ladder diagrams omits many terms. However, the omitted terms 
are not as important in the limit where n; > 0. 

The first two terms in the sum of ladder diagrams have already been derived; they are 
n®, and 2), in (8.44) and (8.58). The superscript denotes the number of ladders. The 
superscript (L) denotes sum of ladder diagrams. The series of terms in the ladder sum can be 
generated by representing them as a vector vertex function re: 


2e* 


3m2v 


ROO) = Yap Tp, ip, ip + io) AP, ip), ip + ia) (8.59) 
P ip 


] 
T®(p; ip, ip + io) = p + = TO (p; ip, ip + io) WO (ip, ip + io) 
p’ 


x G(p’, ip)G(p’, ip + iw) (8.60) 


Repeated iteration of (8.60) will generate the series of terms in the ladder summation shown 
in Fig. 8.3(c). The ladder summation will be evaluated, approximately, in order to obtain the 
factor of (1 — cos 6’) in the relaxation time. 

The correlation function is a function of ip and iœ in the combination ip and ip + ia. 
Define the quantity 


oo, 2e? a oo, 
Pip, ip + i@) = = 2 p ' T” (p; ip, ip + 10) H(p, ip) AP, ip + ia) 
p 


| (8.61) 
tin) = BÈ P(ip, ip + iœ) 
ip 
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The summation over Matsubara frequencies ip is evaluated, as usual, by examining the 
contour integral f dzn;(z)P(z, z + iw). The integrand has the poles of n;(z), which give the 
summation over ip, and also branch cuts along the two axes z = £ and z = e — ia, where £ is 
real. These cuts are shown in Fig. 8.4. The integral equals the contribution from the two 
branch cuts, where one has to subtract the parts above and below each cut: 


od 
nr (io) = -| << np(e)[PCe + ið, € + i@,) — P(e — id, € + iœ) 


+ P(e — i@,, € + ið) — P(e — iw,, € — id)] (8.62) 


The infinitesimal part +ið is unnecessary when there is a term with tio,. Next find the 
retarded function from the analytical continuation iw, —> œ + ið. A variable change 
e — £ + in the last two terms brings us to the point 


rE (o) = i s {[n-(é) — np (E + w)|P(e — ið, £ + @ + iò) 


— np(e)P(e + ið, e + © + ið) + np (£ + @)P(e — ið, e +œ — ið} (8.63) 


The next step is to take the imaginary part of this expression. Because of the factor of i in the 
denominator 27i, this step gives the real part of the integrand. The subsequent step is to take 
the limit that œ —> 0. The function P(e — ið, e + œ + ið) becomes P(e — ið, e + ið) at zero 
frequency, which is real. It is real because the function is symmetric in its arguments, 
P(Z,, z2) = P(z2, Z,), so that the complex conjugate of P(e — ið, € + ið) equals itself. The last 
two factors of P in the above equation are complex conjugates of each other. Taking the real 
part of the integrand removes their imaginary parts, and the real parts are equal. 

The limit œ — 0 again only involves [np(€) — np (£ + @)]/@. These steps brings us to a 
formal expression for the dc conductivity: 


imin® 
o=— lim ( M[Tret ) 
a— 0 63) 


(00) 
= B = (- a) {P(e — ið, e + ið) — Re[P(e + ið, € + id))]} 
There are only two functions which need to be found, P(e — ið, € + iò) and P(e + ið, € + ið). 
At zero temperature, where —dn,/de = (e), they need to be found only at e = 0. These two 
functions have quite different behavior and are obtained by different methods. Both are 
usually important, but the most singular is P(e — ið, e + ið), and this term leads to the factor 
of (1 — cos 0^) in the lifetime. 


FIGURE 8.4 Branch cuts in the contour integral. 
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Consider the definition of P in (8.61): 


P(e — ið, € + i) = TO (p, £ — ið, £ + 18)G(p, £ — i5)G(p, £ + iô) 


The product of Green’s functions is entirely real: 


FP, € — i0)F(p, € + 15) = Gagy(DE)Gre(P, €) = 


(2 + o2 + AZ 
_ A(p, €) 
= DA(p.6) (8.65) 
© = £ — bp — Re[X(p, £), A = —Im[2] 
(8.66) 


P(e — ið, € + ið) = p- Tp, e — ið, e + iô) 


e l dp Alp, £) 
3m? J (27) A(p, £) 
The combination G dyvGret is rigorously defined as the spectral function A(p, €) divided by 


24(p, £). There is no assumption that A* = A/2A. The same combination is found in the 
equation for the vertex function: 


3 
rO (p, £ — ið, € + ið) =p+| oor Tp’, £ — ið, £ + ið) 


x WHE — ið, e + 18)Gyay(P', £€)G;a(p', £) (8.67) 


The vector function I’ must point in the direction of p since that is the only vector in its 
function arguments. It is convenient to define an integral equation for the scalar function: 


py(p, £) = Fp, £ — ið, e + iô) (8.68) 
_ AWD, E) PP Lay 
y(p, €) = 1+ [52 7 IAE PP P yy p (E — ið, e + id)y(p’, £) (8.69) 


P(e — ið, € + ið) = 


5l] d°p A(p, €) p’y(p, £) (8.70) 


3m? J (27) A(p, e)” 


Equation (8.69) is a one-dimensional integral equation for the scalar function y(p, €), where p’ 
is the integration variable. The angular integrals dQ, just average the quantity p ° p’ We over 
angles to provide the kernel for the integral equation. The integral equation should not be 
difficult to solve with modern computers for realistic self-energies and 7-matrices. 

The Boltzmann result is obtained in the twin limits T — 0 (e — 0) and n; > 0, where 


A(p’, 0) > 2n0(G,,). Equation (8.69) then reduces to the integral equation 


dp 2nÒlEp) pp’ 


y(p, 6) =1+ jon TAG.) plow YO €) (8.71) 
A- -2 
(kr) = 1 + yk) — ~ (8.72) 
d°p' p'p’ I 
2A, = 2tn, /)f1—-_ P= l 
l NA; [ies a(S, p? [iT | T, (8 73) 
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which is easily solved to give 
A 
y(kr) = A. (8.74) 
1 


The factor (1 — p: p’/p*) = (1 — cos 89’) since |p'| = |p|. The solution y = A/A, is put into 
(8.65) to give 


e f[ ad A(p, 
P(e — ið, e + ið) -alo p ans (8.75) 


which gives the same conductivity as (8.13) when put into (8.64). The term P(e — ið, € + id) 
leads to the important contribution as T — 0 and n; —> 0. 

The other term P(e + ið, e + ið) should not be neglected. It is complex but not very 
singular in the limit where n; —> 0. As is evident from the definition (8.61) the singular parts 
should arise from the Green’s function product: 


l œ? — A* — 2iwA 
G E +i) = Gh ey = nmn sau o 8.76 
(p ) (Pp ) (© E iA)? (0? 4 A?) ( ) 
1 2A? 2iwA 
TD Ad 2\2 L 2\2 (8.77) 
o +A (œ +A yY (è +A’) 
A l3} , 


As n; — 0 then A* — A/A and the real part of this expression vanishes. The imaginary part 
becomes 6(@), which is also zero. The singular parts of this expression vanish as n; — 0. 
The vertex corrections to P(e + ið, e + ið) are not of order unity but of order O(n,). In this 
case, the vertex corrections may actually be a series of terms which are successively smaller, 
so that the vertex corrections may be obtained by just evaluating the first few. The situation is 
quite different than for P(e — ið, € + id), where one has to solve the vertex equation and sum 
all the ladder diagrams. 

One can evaluate P(e + ið, e + ið) by solving a vertex equation similar to (8.69). Define 
the scalar vertex y’ by Pp, e + ið, e + iô) = py'(p, £), and then 


d'p pp’ 2 2 
)=1tn] Toy (&) Y (P, €)G ilp, € 8.79 
y (p, 8) On? p pp E) Y (Ds €)Gret(P , £) (8.79) 
The vertex function y'(p, €) is complex, as is the vertex Top (£), and the product G..,(p’, €Y. 


There are two coupled equation for the real and imaginary parts. It may be obtained from the 
Ward identity (1950), 


I'(p, € + ið, € + iò) = p + mV È(p, € + id) (8.80) 


which is an exact identity between the exact vertex function F(p, € + ið, € + iò) and the exact 
retarded self-energy Ł(p, € + iô) from impurity scattering. This self-energy is not the one 
derived in Sec. 4.1, since that was only for scattering from a single impurity and only had 
unperturbed Green’s functions G© as the internal lines in the scattering equation. The exact 
self-energy is found from scattering from all numbers of impurities and using exact Green’s 
functions as internal lines of diagram—although this procedure must be done carefully in 
order not to count the same contribution twice. The Ward identity is very convenient, since it 
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permits the vertex function to be obtained from the self-energy function by a simple opera- 
tion. It will be proved in the next section. 

The present derivation has an approximate vertex function which contains only the 
ladder diagrams. In that case the Ward identity is still valid as long as the self-energy is found 
using only diagrams where an electron scatters from one impurity at a time. 

The Green’s function derivation of this result is certainly harder than the transport 
equation of Sec. 8.1.1. The one advantage of the Green’s function method is that there is no 
need to make the approximation that n; — 0. The n; — 0 limit is made implicitly in solving 
the Boltzmann equation when it is assumed the particles are plane waves except for occa- 
sional scattering events from isolated impurities. 

There are several lessons to be learned from this summation of ladder diagrams. The 
ladder diagrams, although they appear in a higher order of perturbation theory, do not lead to 
terms in the final answer which are smaller than lower-order terms. A term is not necessarily 
small if it occurs in a higher order of perturbation theory. It is this feature of vertex 
corrections which means they can never be dismissed without an investigation. They may be 
small, but one should always check. It would be nice to have some rules of thumb which 
would establish whether vertex corrections are important. The way to tell is to examine the 
scattering process which causes the vertex functions. If the two particle states, on each side of 
the bubble, can scatter quasielastically so that their relative energy changes little, then vertex 
corrections are large. The vertex correction is basically a potential divided by an energy 
denominator V/AE, where AEF is the change in energy of the two particles. If it is small, the 
small denominator will compensate for the small potential V so that vertex corrections 
become sizable. Repeated scatterings, as in a series of ladder diagrams, just cause additional 
powers of the factor (V/AE)". Vertex corrections are large when the scattering by the potential 
causes only a small change in the relative energy of the two particles. 


8.1.3. Ward Identities 


The evaluation of a two-particle correlation function, such as the Kubo formula for the 
conductivity, often requires an evaluation of a vertex function. The Ward (1950) identity is an 
exact relationship between the vertex and self-energy functions in the problem. As an 
example, two types of Ward identities permit the evaluation of the scalar vertex function 
I (p, ip) or the vector vertex function I (p, ip) which satisfy the equations 


. d? ’ fo. Jo. . . 

TO, ip) = 1+ | s z Tp, ip)2(', ip)? Wy (ip, ip) (8.81) 
. dp’ I I n2 . . 

F'(p, ip) = p + | SPE DIW. ib) Wi. i) (8.82) 


The Ward identity states that these two functions are given by 


I (p, ip) =1- on | (8.83) 
z=ip 
I(p, ip) = p+mvV,2(, ip) (8.84) 


An evaluation of the self-energy function X(p, ip) permits an easy evaluation of these two 
vertex functions. These relationships are proved below. 
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An important point regarding the Ward identities is that they are not useful for evaluating 
all vertex functions. An example is provided in the last section, where the Ward identities 
were useful for finding P(e + ið, e + ið) but not P(e — ið, e + ið). The Ward identities cannot 
be applied blindly; they must be used only when appropriate. These circumstances are 
delineated after the identities are proved. 

The Ward identities for impurity scattering were derived by Langer (1961). The similar 
theorems for the electron-phonon interaction were derived by Engelsberg and Schrieffer 
(1963). We shall prove the result for the ladder diagrams obtained by scattering from a single 
impurity. In this case, the self-energy diagram is that for scattering from a single impurity: 


3 


. d'p 
X(p, ip) = n| Vop + Ps Š Vip Vyp , ip) 
Ëp'Ëp ” A _ 
~ (2n)* V op’ Vy 'p” Vyp iP ` ip)G(p Y ip) + oe (8.85) 


An important condition is that the Green’s functions in this self-energy diagram are those 
calculated with the self-energy Y = 1/[ip — §, — X]. Equation (8.85) is a self-consistent 
equation for the self-energy È, since it depends functionally on itself. Unfortunately, the Ward 
identities do not let one avoid solving an integral equation. Instead, one integral equation is 
exchanged for another. In this sense the Ward identities are not very useful in practice. 
Rather than prove the two separate identities (8.83) and (8.84), a general theorem is 
proved for which these are two limiting cases. The general theorem is obtained by subtrac- 
ting the expressions (8.85) for 2(p,ip)=2X(p) by the same result for 


X(p +q, ip + iw) = X(p + q): 


d 
T +4) — EP) = n | o EV lI +D- P 


Ëp Æ 
+n (EP Pid Poy yy [G(r +9902. +9 -FPP 


(2n)° 
Ëp Ëp d 
+ r PP: PiP2 Vp-p; Vosp 
x [pi +) p + DID; +4) -FP P) p) + °°: (8.86) 


By purely algebraic manipulations, this series can be shown to be identical to 


Ëp 
Zip +4) — EO) =m | Fs 


x [(p, + q) 7 X(p)) + Sp +4 ~ Sp, _ iœ] (8.87) 


3 Top, P)Tp i+q, p+ CP + iw)[F(p, + Qa) 


=n [52 Š Top, (iP)Tp +4,p+q(ip + OZP +a) -Fp (8.88) 


This rather startling result may be demonstrated term by term. The first nonvanishing term has 
the integrand 


Vip, Vp pl + 9) — FO) = Vip, ppf + DIP) 
x [X(p) + q) — X(p;) + Gp ta ~ Sp, ~ io] (8.89) 
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which is just the vertex diagram shown in Fig. 8.5(a). The next term in the series (8.86) has 
the integrand 


Vow, “pp. “ppl EP + VIP2 + a) — FPF) (8.90) 
The Green’s function factors in brackets may be rearranged into 
GD, + Q)GP2 + 9) — FPI) = FP, + Dp +4) — Apr)! 
— FPZ p: + 9) — FP1)I (8.91) 


The first bracket on the right is the first diagram in Fig. 8.5(b). The factor V,, Vp p,4(P1 + 9) 
is just the multiple scattering from the impurity by the electron on the lower line. Similarly, 
the second term in (8.91) corresponds to the second diagram in Fig. 8.5(b), where the factor 
Voip, Vp 4(P2) i is the multiple scattering of the top electron line. These terms are just the first 
in the series which generates the 7-matrices. The further terms in (8.86) provide the remaining 
terms. In this manner, one can establish the validity of (8.88). 

Equation (8.87) shows the quantity 


obeys the vertex equation 


A(p, p + q) = Cp+q ~ SP — 10) 


+m |e n Ipp, + DT op,Toraneqh Orit (8.93) 
The two equations (8.92) and (8.93) provide the most general type of Ward identity. They are 
useful, since any equation which can be cast into the form of (8.93) has the solution (8.92). 
Langer (1961) and Engelsberg and Schrieffer (1963) show that this equation is related to the 
equation of continuity V +j + p = 0. 

The first Ward identity (8.83) is obtained by taking the limit q = 0 and then dividing Eq. 
(8.93) by —iœ with the result 


A(p, ip, ip + iw d? 
AQ. ip ip tio) _ | | PL S(p), ip) 9p, ip + ia) 


—10) (27 y 

-n APL ip, ip + io) 
x Typ, (ip)Tp p(ip + io) (8.94) 
The quantity I (p, ip) in (8.81) obeys the same equation as A/(—iw) in the limit iœ —> 0 so 


they are equal. From (8.92) one has the solution 


, A(p, ip, i j 0X(p, Z 
T(p, ip) = lim [Apipi _,_/2@.4 (8.95) 
io—>0 —10@ oZ Z=ip 


which proves the Ward identity (8.83). 
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The other Ward identity is found as the limit im = 0, followed by letting q — 0. The 
latter limit is taken slowly, so that one can retain terms proportional to q. In this limit Eqns. 


(8.92) and (8.93) become 


lim A = + +q°V,2(p, ip) (8.96) 


q—>0 


e d P 
= La +n; [2 3 Gp, ip)\A(p, +q, ip)T, PP) Ty, +4, ptqA(P1;P1 + q) (8.97) 
The vertex function A is proportional to q, so define the vector vertex function by the limit 
(iœ = 0): 


l 
lim A = — q: I '(p, ip) (8.98) 
q—0 m 


Then the preceding two equations can be expressed in terms of this vector vertex function: 


F(p, ip) = p + mV ÈQ., ip) (8.99) 


dp 
I(p, ip) =p + mn [oes 
(p. ip) =p (ony: 
This equation is the same as (8.82), which proves the other Ward identity (8.84). Both are 
now understood to be limiting cases of the general result (8.93). The Ward identities are 
useful anytime one can cast the vertex equation into the form (8.93). 

The factor [1 — 02/02] is recognized as the inverse of the renormalization Z defined 
earlier and discussed, for example, in Sec. 5.8.1. This quantity is sometimes called the 
effective charge. Similarly, the vector vertex is one of the factors which give the effective 
mass of the particle. The Ward identities relate the vertex corrections to a change in the 
effective charge and mass of the particle, which is why they are related to the equation of 
continuity. 


5 (Pı. iP) |Top, GP TO, ip) (8.100) 


8.2. MOBILITY OF FRÖHLICH POLARONS 


The Fröhlich Hamiltonian between electrons and Einstein phonons (@) = ©zọ) is 


H = Z pC pol po + È daa + 2G Chirac Cyo(aq + 414) 
(8.101) 
PETSA € OP 
«v 2mp P 2m, 
e/m \'7?(1 1 
=i aa) E mee 


It was discussed in Sec. 7.1, where several important quantities were derived, such as the 
effective mass m* and the ground state energy Ep. 

The effective mass m* of a particle can be measured by cyclotron resonance. Such 
experiments have been done for polarons (Hodby, 1972). The effective mass m* is a function 
of the band mass mg, and the polar coupling constant «. A separate measurement of the two 
dielectric functions £ and £x, as well as @,9, permits a determination of the band mass mg, 
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and « from m*. This analysis takes a theory of the polaron mass m*(mg, œ), which was 
provided in Sec. 7.1. Another way to check the theory is to measure the mobility of electrons 
in insulators. The mobility also depends upon mp and «. 

A typical experimental result for the mobility is shown in Fig. 8.6. It shows the Hall 
mobility of CdTe measured by Segall et al. (1963). The steep rise around 200K is due to 
optical mode scattering. At lower temperatures the mobility saturates because of the scattering 
from impurities in the crystals. Impurity scattering varies from sample to sample, depending 
on the concentration and type of impurity. 

The average value of the current operator is the particle density nj times the charge e 
times the average velocity (v). The average velocity (v) is proportional to the applied electric 
field F, and the constant of proportionality is the mobility, where e > 0 so the electron charge 


1S —e: 


(8.103) 
(8.104) 


HALL MOBILITY (cm*/volt-sec) 
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FIGURE 8.6 The temperature dependence of the electron mobility of several samples of n-type CdTe. Different 
samples have different kinds and concentrations of defects or impurities. The solid line is the theoretical mobility 
from optical mode scattering, including the temperature dependence of the static dielectric constant. The dashed line 
neglects this change. Source: Segall et al. (1963) (used with permission). 
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The mobility u is the average velocity of each electron per unit applied electric field. Of 
course, it is strictly defined in the limit of vanishing electric field. Since the electrical 
conductivity o = nje*t,/m = —epny is the ratio of the current J to the field F, then 
u = —et,/m. 

The theories of the electron mobility in insulating materials, such as alkali halides and 
II-VI semiconductors, treat it as a property of a single electron. The electron lifetime is 
calculated for the scattering from impurities and by acoustical and optical phonons. The 
electron—electron interactions can be ignored in the limit where the concentration of electrons 
is very low. Spin is also unimportant, and the spin index is omitted in this section. 

There are just as many different ways to calculate the polaron mobility as there are to 
calculate the effective mass m* or ground state energy Ey. Each theoretical technique was 
applied to the mobility as well as to the other quantities. As summarized by Langreth (1967), 
these various methods usually agree in the limit of weak coupling (a <1) and low 
temperature kT < iW. In this case 


. E eTo — 

lim u = m, = (8.105) 
l aN o N =—— (8.106) 
to 0 0 T ebo% — | l 


The lifetime tọ is the result obtained in (7.46) as 1/t) = —2 Im[=“(p, ¢)] in the limit where 
p — 0 and £ — 0, where ED is the one-phonon self-energy. This limit is appropriate, since at 
very low temperatures the electrons are in states within kgT «< h@, of the bottom of the band. 
These low-energy particles cannot emit phonons, since this event is prevented by energy 
conservation. They can only absorb them, and the rate of absorption is proportional to the 
thermal average density of phonons Nọ. The factor Nọ makes the mobility increase expo- 
nentially with decreasing temperature, since the electron scattering becomes less likely as the 
number density of phonons declines. The exponential increase in the mobility is evident in the 
experimental data of Fig. 8.6. The behavior of large polarons is opposite to that of small 
polarons, whose mobility increases with increasing temperature. 

One feature of the mobility formula (8.105) is that it is proportional to the inverse of a. 
Our starting point for the theoretical calculation is again the Kubo formula, which will be 
evaluated for using electron—phonon interactions. The expansion of the S matrix for this 
potential will generate a series in the parameter «. To obtain a leading term in the inverse 
power of « requires the summation of a subset of diagrams. The situation is similar to the 
mobility from impurity scattering, where diagrams were summed to get the conductivity 
inversely proportional to the impurity concentration n;. There are important differences 
between the lifetime from opical phonon scattering and that of impurity scattering. This 
conclusion is evident from the result presented in (8.105). Here the relaxation time is not 
calculated with the factor of (1 — cos 0^) in the angular average. The polaron mobility is 
calculated in a different way than the scattering from impurities. Actually it is calculated in 
the same way, but a different result is obtained. This difference arises from the inelastic nature 
of the polaron scattering as first shown by Howarth and Sondheimer (1953). 

Langreth and Kadanoff (1964) showed that the polaron mobility u is a power series in «, 
with the leading term in (8.105) of order 1/«a: 


p= + aga? + aya! foe (8.107) 
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They tried to calculate the coefficient a_,, apin this series in the limit of low temperature. This 
objective is simple but has slippery aspects (Mahan, 1966). It is a subtle procedure to examine 
each term in the S-matrix expansion and to determine its leading term in «. The situation is 
similar to that for impurity scattering. Each term in the series (8.107) is obtained by summing 
subsets of diagrams. 

The first term of Langreth and Kadanoff (a_,) is quite simple, and it is probably 
worthwhile to state it in advance. Define Up as the result (8.105) for the limits x —> 0, T — 0. 
They found 

u 


X 
— = 1- -+ O(a’) (8.108) 
Ho 6 


They observed that o/u is precisely the expansion given by the equations 


n=- (8.109) 
m 
T = 1+0) (8.110) 
To 
M =1+Ž+0) (8.111) 
7 6 


The ballistic formula u = —et/m* supports the quasiparticle picture that the particle acts as if 
it has an effective mass m* and lifetime t. They speculate that the quasiparticle picture would 
be valid for all values of « and that the inclusion of all terms in « would just reproduce the 
product series of m* and T. 

Before the derivation of the electron mobility, it is necessary to derive some single- 
particle properties. These will be needed in the limit of zero temperature. All terms are 
dropped of order Nọ compared to unity. Many of these single-particle properties were derived 
in Sec. 7.1. The first self-energy term, proportional to «, is the one-phonon result in (7.15): 


3/2 ; 1/2 
Re[X(p, w)] = ~a sin! | —_#_ (8.112) 
Je (o-ote, 


At zero temperature, this self-energy is evaluated at small £, and small œ, 
Re[=(p, w)] = —alo +40 — 48, + Ola’, £3, we,)] (8.113) 


This expansion permits a quick derivation of the effective mass m* and the renormalization 
coefficient Z. The zero subscript means p = 0, œ = 0: 


IEN! l 0 
Z=(1-—]) = ~1—=+ Ox? 8.114 
0 ( xa), | + «/2 z t OW) (8.114) 
mB) = Oy IT sity oo 8.115 
Cr a(i) gT Om) (8-115) 


Another important quantity is the lifetime t which is defined as 


l 
qp 7 ZON-2 IME, EID (8.116) 
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where Z(p) is the renormalization coefficient evaluated at œ = E,, which is the ground state 
energy. The ground state energy is only needed to order «, which is the simple result 
E, = —aMy + p*/2m* + OQ’). 

Equation (8.116) shows that the renormalization coefficient Z(p) enters into the defini- 
tion of the lifetime. The argument for this is as follows. The spectral function is defined as 


A(p, ©) = —2 Im[G,e(P, o)] (8.117) 
_ PIO (8.118) 
{o — €, — Re[X(p, œ) + mÈ, o)l} 


A suitable definition of t(p) is obtained by examining this limit more carefully when Im(2) is 
small but not infinitesimal. In the vicinity of the peak œ ~ E, of the spectral function the term 
in the denominator is 


o — £ — Re[X(p, @)] ~ @ — e, — Re[X(p, E,)| — (@ — E) ° =e > 
x —E 
z~ (œ — B| —? =e | = Oe (8.119) 
so that the spectral function is approximately 
—2 Im[X(p, œ)] 
9 = — Cn 8. 
APO = oo E,y /Z(py + Im) (3-120) 
1/1(p) 

= Z(p)| ——— 5s 5 8.121 

p) a -Ef + aa) 6-121 


The last step used the definition (8.116) of the relaxation time. The relaxation time t(p) is 
treated as a function of p but not œ. This form for the spectral function is used in the Green’s 
function (Problem 6 in Chapter 3), 


Gap.) =O0| Seay, 0) (8.122) 
= —iZ(p)@(t) exp[—itE, — t/(21)] (8.123) 


The relaxation time is defined from the decay of the Green’s function. It has the desired form, 
with the relaxation time t(p) determining the decay of the excitation. 

There is another way to understand the factor Z in the definition (8.116) of t(p). The 
quantity —2 Im(2) is the rate of decay of a state (p, œ). The factor Z is the fraction of the 
quasiparticle strength at the value (p, œ). The rest of the quasiparticle strength is usually 
dispersed throughout the spectrum. 
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The one-phonon self-energy is evaluated for the quasiparticle lifetime. The imaginary 
self-energy is evaluated at the quasiparticle energy œ = Ey = —a@ and is multiplied by the 
factor of Z given in (8.115). The imaginary self-energy is calculated from the expression 


dq M2 

—2 Im[Ep, ©)] = 27 — 8.124 
Eoo) = 2" | (8.124) 

x [No Slo + M — Eq+p) + (No + 1)8(@ — Wo — Equip)! 

3/2 JO FO + /E 
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Expanding for small @ < @ and small €, gives 
N0 
—2 Im[X(0, = ———— 7X 24N, — 1 a. 8.126 
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— 2 Im[X(0, œ = —«0p)] = 2a] 1 + ; + o(a) (8.127) 
l l Oo Oo 
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l 
=—[1 + O(a )] (8.128) 
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The first correction terms in « from Z = 1 — a/2 and Im[=] = 1 + «/2 cancel to order O(a). 
The one-phonon term provides no correction term to t of O(a). 

The electron mobility is evaluated be the same method which was used for impurity 
scattering. Starting from the Kubo formula for the electrical conductivity, the first important 
diagram is the bubble with interacting Green’s functions shown in Fig. 8.7(a). The solid lines 
are total Green’s functions &(p, ip,,) which include the self-energies 


B 
n(i@) = -| dre!" (T jį) j0) (8.129) 
0 
2e f dp 51 . 
Om) = 2 
Oio) = sy | PE GD. iP ID, Pn + i0) (8.130) 


The solid lines are total Green’s functions Y(p, ip,,) which include the self-energies. The 
evaluation of this term is identical to that used for the same bubble diagram for impurity 
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scattering. The contribution to the Kubo formula contains the two Green’s functions as in Eq. 
(8.44), which are evaluated by the standard series of steps to yield Eq. (8.49): 


2 dP 


In the present application, the self-energies in the Green’s functions are evaluated from the 
Fröhlich Hamiltonian rather than impurity scattering. In a more realistic model of a solid, both 
self-energy expressions should be included: from phonons and impurity scattering. 

An approximate evaluation of this contribution to the conductivity is obtained in the 
limit where « < 1. The electron distribution is assumed to be Maxwell—Boltzmann: 


gue) _ "o (2B) -e 
ngle) © eh) = —({——] e™ 8.132 
P(e) 2 (=) (8.132) 
where np(£) is the energy distribution for each spin state, while nọ is the total concentration 
for both spin states. This term in the conductivity is divided by —eng to get the corresponding 
term in the mobility of each electron: 


3/2 
p” = -— (= ) | s, p |E A(p, €} eP (8.133) 


Using the expression (8.121) for the spectral function, the square of the spectral function is 


(Z/ 


2 
Ap, ey = {(e — E} + (2t) “VY 


~ AntZ°&(e — E,) (8.134) 


The electron lifetime t is inversely proportional to «, so that the mobility u® is inversely 
proportional to «. The preceding integral is evaluated in the limit where the temperature 
T — 0. The integral is easy if it is assumed at low T that Z(p) and t(p) are evaluated at p = 0 


m* eT 
u® = -erz = -—(1 — Ža + O(a’) (8.135) 
B B 


where Z(0) = 1 — «/2 and m*/mg = 1 + «/6. The result (8.135) is the contribution from the 
simple bubble diagram of Fig. 8.8(a). This result does not resemble (8.108). The differences 
disappear when higher-order diagrams are considered, such as those in Fig. 8.7. 

Other contributions to u can be derived from the other diagrams, which are the vertex 
corrections. In the limits T — 0 and « < 1 the vertex corrections do not contribute to the 
mobility a term which goes as O(1/a). The simple bubble result is the final answer at low 
temperature and weak coupling. This conclusion is quite different from the situation for 
impurity scattering. There a series of ladder diagrams was summed in order to derive the final 
answer, and each ladder diagram gave a term which was the same inverse power of the 
coupling constant as the impurity concentration n;. The vertex diagrams for optical phonon 
scattering are less important because of the inelastic nature of the phonon scattering. Mahan 
(1966) showed that the two-phonon ladder diagrams in Fig. 8.7(c) provide the largest vertex 
corrections, which are of O(T/@p). 
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8.3. ELECTRON-PHONON RELAXATION TIMES 


This section is concerned with the calculation of electron relaxation times in semi- 
conductors from the scattering by acoustical phonons. The prior sections have introduced two 
different relaxation times for the electron in a solid. One of these is the average time between 
scattering events and is denoted as t(k). The mean-free-path (mfp) for this relaxation time is 
denoted as /(k) = v, t(k). The energy bands in the solid are assumed to be isotropic, so that 
the relaxation time and mfp depend only upon the magnitude of the wave vector. The other 
relaxation time t, enters into the electrical conductivity (o = nge*t,/m*). The equivalent 
quantity, before averaging over wave vector, is called the momentum relaxation time and is 
denoted with the subscript “t” since it is used in transport of current. The equivalent mfp is 
l(k) = 0,1,(k). 

To further confuse the topic, the transport lifetimes t, for scattering from impurities had 
a factor of (1 — cos 6’) in the integrand of the scattering integral. The scattering from optical 
phonons did not have such a factor. The difference is that at low temperature, impurity 
scattering is elastic, while optical phonon is highly inelastic. Vertex corrections are relatively 
unimportant for inelastic scattering. 

In writing the Boltzmann equation, the last term in the time development is the rate of 
change of the distribution function from collisions. The discussion of the lifetime from 
impurity scattering in Sec. 8.1.1 started with this collision term. Here the similar collision 
term is presented from the scattering due to the electron-phonon interaction: 


YSP) _ on _ 
CD) =F [ae nar vet re +a) 


x [(Nq + 1L)8(Ep — Epp g — A@q) + Ngd(Ep — Ep+q + ÉO) 


—f(p + q)[! “ply, + 1)d(E, ~ Ep+q + hOg) + NgC(Ep ~ Ep+q 7 ÁO) 
(8.136) 


The integrand has four terms. The first two correspond to an electron initially in state p 
scattering to p + q by either the emission (N, + 1) or absorption (Nq) of a phonon. This term 
is multiplied by the occupation factors f(p)[1 — f (p + q)] which ensures that the initial state p 
is occupied and the final state is empty. The other two terms correspond to processes whereby 
electrons initially in p + q scatter back into the state p by either phonon emission or 
absorption. The back scattering has the occupation factors of f(p + q)[1 —/(p)]. The above 
expression vanishes when the system is in thermal equilibrium and f(p) =n-(§)), 
Nq = q` ng(Oq). 

An immediate simplification of the above formula is attained by grouping together the 
terms with the same delta function for energy conservation 


EP), aem 
ðt jap (27)? 


x [5(Ep — Ep+q — 2Oq){f (PIN, + 1—f(P + q)] — Ngf(p + g)} 
+ O(€) — Ep+q + “+ hop (PIN, +f + q)] —f + QIN, + 1} (8.137) 


The above expression is quite general. 
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The terms which multiply f(p)are those which define the relaxation time 1/t,(p). 
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The subscript “p” denotes scattering by phonons. In Eq. (8.138) the relaxation time from 
phonon scattering is all of the terms which multiply f(p) in (8.137). It is identical to the 
expression obtained from the imaginary part of the electron self-energy, as calculated in the 
one-phonon approximation. The occupation factors f(p+q), Nq are represented by their 
values in thermal equilibrium. The matrix element is from deformation potential interactions. 

The transport lifetime t,, from the electron scattering by phonons is defined in (8.140). It 
has an additional factor of (—q + k/k?) in the integrand. If the scattering were elastic, then this 
factor equals (1 — cos 0°). The difference between the impurity scattering times 1,, and 1,,, is 
the factor of [1 —cos(®)] in the integrand of 1/t,, where O is the scattering angle 
ke °. D = cos(8). 


To lam (ns Tiel 8k — &) (8.142) 
5 = “mr p (27 y | Tp |? [1 — k + pl8(e, ~~ E,) (8.143) 


The identical factor is used to calculate the momentum relaxation (t,,) from scattering by 
phonons. If p = k + q then the angular factor [1 — cos(8)] ~ —q-k/ é The identity is exact 
ifp = k. Using this factor of [1 — cos 0] in the impurity scattering is rigorously correct. Using 
this factor in the scattering by phonons is not rigorously correct. The difference is that 
the scattering by phonons is inelastic. Instead, the correct result is found by solving the 
Boltzmann equation, which is done in Sec. 8.4. However, the above approximation 1s actually 
quite good, and therefore is useful. The lifetimes for scattering by phonons are evaluated 
below. 

Besides these two relaxation times, there are several others which are occasionally 
useful. The most important is the scattering by electron—electron interactions in a metal. The 
formulas for this case are derived in Chapter 11 in the discussion of Fermi liquid theory. Two 
others are the relaxation times for temperature, and the relaxation times for energy. These two 
are discussed below. 

The relaxation time for energy determines the rate at which the electron loses or gains 
energy from the scattering. Since impurity scattering is elastic, the electron does not change 
its energy. There is no contribution to energy relaxation from impurity scattering. However, 
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in the scattering by phonons the electron changes its energy by Ek+q — Ek = th@,. One 
process gains energy (+,) while the other loses energy (—hw,). There is a net gain or loss 
of energy if one process dominates over the other. For example, if an energetic electron 
is injected into the solid, it will gradually lose energy until it equilibrates thermally. The 
process of coming to thermal equilibrium requires that it emit more phonons than it 
absorbs. 

The rate of energy relaxation is calculated by starting from (8.138) and inserting the 
energy change & 4,4 — Ex into the integrand 


dE 2 
(a) =" =-2 (<5 Fis IM Pho 5(~ aq = RING + 1 = Mele) 
— O(€% — Ek+q + AM)IN, + MrlEx+q)]} (8.144) 


Converting this expression to a relaxation time (t,,) and mfp (/,,) is done below. 

The evaluation of the relaxation times for the electron-phonon systems is done for 
metals and semiconductors. In metals, the evaluation assumes the existence of a Fermi 
surface. The electron-phonon scattering affects those few electrons within about kgT of the 
Fermi surface. In semiconductors, the interesting effects are for a single electron in the band, 
and the reference energy is the band edge. These two cases are different and are treated 
separately. 


8.3.1. Metals 


In metals the derivation is simplified by introducing the McMillan function denoted as 
x?F(œ). In this function are collected all of the complicated parts of the phonon dispersion, 
and the matrix element (see Sec. 7.4) 


x F(E, ©) = [<4 FSCO — OSCE — &4q) (8.145) 
we F(E, œ) = |< 5 \M,| *(- 1) S(O — @,)5(E — Ek+q) (8.146) 
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=F | MOF o alo) + nel ~ 60 
+ F(E; + ©, @)[ng(@) + np(@ + &)]} (8.147) 


The dimensionless functions aF (E, œ) depends upon the variable E. However, this 
dependence is similar to the variation in the density of states at the Fermi surface. The 
variation with Æ is usually smooth on the energy scale of the Debye energy, which is all that is 
relevant for interactions with phonons. The important variation is 
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The second term is usually smaller than the first by a factor of ha/W, where hq is a phonon 
energy while W is an electronic bandwidth. The usual approximation is to neglect the 
dependence upon the factor of E The above expression is rewritten as 


Op 
a = =| doc F(o)[2ng(o) +n;-(@ — £) + np (0 + €;)] (8.149) 
Tole) (k) h 0 
High temperature is defined as T greater than the Debye temperature. Most solids have 
Debye temperatures less than room temperature, so that T = 300 K is a high temperature. In 
this limit, the largest term in the bracket is np(@) © kpT/ho and the lifetimes have the simple 
expression 


í 2rÀkgT (8.150) 
T 
P 
h 
— = 2nh,kgT (8.151) 
Tot 
© dw 
Aw =2 | T MF (@) (8.152) 


The inverse lifetime is proportional to temperature, and the constant of proportionality is 
the dimensionless electron-phonon coupling constant à}. The resistivity of metals 
[p = m/(e*not,)] at high temperature is linear in T, and the slope is given by A, A 
measurement of p(T), gives A,, which can then be used to predict the transition temperature 
for superconductivity in that metal. This process works quite well. 

The equation for energy relaxation is 


(=) = 2nh | doo? Foyt —n(&& — @) — np (£; + )] 
pe 0 
~ [1 — 2n-(e,)|AnA(w’) (8.153) 
Maw’) = 2 dowr (œ) (8.154) 
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The frequency integrals go from zero to the Debye frequency ©p = kg@p/h. The occupation 
numbers were expanded n;(€, +œ) © np(e;) + on. The first derivative terms cancel and 
the second derivative terms are small and are neglected. The factor [1 — 2;(¢,)] is one for 
electrons and minus one for holes. The energy relaxation always takes the particle to the 
chemical potential. 

The energy relaxation (dE/dt) is governed by the quantity 4(w7). It is a single function, 
although the notation gives the impression that it is a product of two functions. The quantity 
hd(w*) has the units of Watts. In this case there is no obvious lifetime Te nor mfp (l,e). The 
phonons give a constant value to the energy relaxation as long as the electron’s energy 
E=& — H > hp. 


8.3.2. Semiconductors 


The relaxation time for a semiconductor is calculated assuming that there is only one 
electron in the band. Most semiconductors have electrons or hole in one or several band 
minimum. In order to keep the discussion simple, the present calculation will be done 
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assuming there is a single conduction band at the center of the Brillouin zone. This situation 
applies to GaAs and other III-V and II-VI semiconductors. The phonon wave vectors are 
rather small. An electron with energy €, ~ k,T has a small wave vector k. If it emits or 
absorbs a phonon and goes to energy & ,, + f@,, then |k + q| is a small wave vector. The 
consequence is that g is also a small wave vector. At small wave vectors, it is a good 
approximation to represent acoustical phonons by the Debye model (œ, = c,q) and optical 
phonons by an Einstein model (@, = po). The results for acoustical phonons are derived here. 
The case of optical phonons is assigned as problems, although the derivation is similar to that 
for Frohlich polarons in the last section. 

The wave vector integrals can be done analytically since the Debye approximation is 
accurate. Define 


dP 
CFA (ko) = R |M, (Ex — Ek+q £AW,)O(M — Oq) (8.155) 
D?m* , 
= Beh pk” lo,<o<a, (8.156) 
O, = 2c,(k + k,), ©; = 2c,(+k, — k) (8.157) 
k, = m*c,/nh (8.158) 


The lower limit œ; can be set equal to zero since the factor of œ? makes this contribution 
negligible. The lifetime is 


1 
-zo 2n | det? PC, oingo) + 1] + Fk, w)ng(@)} 
p 
There are two interesting limits to this expression. The first is at zero temperature where the 
phonon occupation numbers ng(œ) = 0. The particle is assumed to be energetic so that 
k > k,. In this case the answer is 


1 
TE = Vp ek (8.159) 
P 
4 D’m* 
P= (8.160) 


The lifetime depends upon the value of kinetic energy. The second case is at high temperature, 
where ng ~ kzT/hw so the result is proportional to temperature. In this case the natural 
quantity is the mfp [/,(k) = v,1,(&)] 


1 Uk 
=£ 8.161 
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Another mfp for the electron is called /,. The symbol ¢ denotes the phase of the electron. The 
value of /, is the distance over which the electron travels before it breaks its phase coherence. 
For electrons in a pure semiconductor, where there are no electron—electron interactions, then 
lẹ = 1,. The distance for phase coherence is given by the mfp for scattering by phonons. The 
phonon can carry away an arbitrary amount of phase, so such scattering does change the 
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phase of the electron in a random fashion. In contrast, the scattering by impurities does not 
change the phase coherence. Impurity scattering changes the phase of the electron, but it 
changes the phase of each electron by the same amount. So coherence is maintained in 
scattering by impurities, while it is not maintained in the scattering by phonons. The impu- 
rities are just part of the one-electron potential which guides the electron as it wanders 
through the crystal. Electron—electron interactions also break phase coherence. 

A calculation similar to finding J, can be done for the momentum relaxation 
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The results are at low temperature 
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and at high temperature 
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The results are the same at high temperatures for the transport and regular scattering mfp: 


l = /,. This equality is expected for the following reason. At high temperatures, the wave 
vector dependence of the interaction is effectively 


\M,\° _D¢ _ D? 
AO, 2pm, 2pc? 


(8.166) 


The interaction is independent of wave vector, so the scattering is isotropic. The angular 
factor of cos(9) averages to zero for isotropic scattering, and the two mfp’s are identical. 
Some numerical results are presented in Table 8.1 for the high-temperature case using 
T = 300K. The factor of pc? = C,,, where C}; is the elastic constant for LA phonons. The 
values for l, are several microns. 
Also included are data for the polar scattering by optical phonons, which produces a 
lifetime Tọ, which can be converted to a mfp using the thermal velocity vp = ./2k,T /m*. 
Energy relaxation in semiconductors is discussed using the formula (8.144) 
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At low temperature the phonon occupation number ng = 0 and the only process is phonon 
emission. Assuming that the particle has enough energy to emit phonons, the above formula 
gives 


dE D?m*k? VEK 
(=) =e (8.167) 
dE l (dE Ez 
— )—=—{—)=-——*% 8.168 
E ) Uk (i ) lye l ) 
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— = — (8.169) 
Lye th* p 
The mfp path for energy relaxation /,, from phonons is quite long. Some values are shown in 


Table 8.1. Typical distances are millimeters. Most semiconductor devices are much smaller 
than these length scales, so that electrons do not achieve energy relaxation in traversing most 
devices unless there are sufficient impurities to cause them to random walk this distance. 
The other interesting case is for high temperature. Expand ng = kgT/o, —5 and find 

dE D kpT 

—)= -2n| daw) [e FO — oF FO) +2 [PFO + PFO] 

dt 0 ho 
The last term gives the same mfp /,,, as was found for zero temperature. The first term, which 
is proportional to kgT, is nonzero when the limits to the frequency integrals are taken to be 
2c,(k + k,), so the integral is 


2c,(k—k,) 2c,(k+k,) 

l doo? — l dow = (8.170) 

— 1 (2c ik +k) — (k — k] = 16k m* c$ (8.171) 
dE l 
——\ =— fe, — 2k 8.172 
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The energy relaxation has the same formula l, for the mfp at room temperature as at low 
temperature. The only change at higher temperature is that the energy £, relaxes to the value 
2kgT, which was zero at zero temperature. 


TABLE 8.1 Electron mfp data at T = 300K. The top lines have material constants. The second group of lines has 
estimated mpf from optical phonon scattering. The last two lines are from deformation potential scattering by LA 


phonons 

Units GaAs InP InAs InSb 
m* Me 0.064 0.078 0.027 0.013 
Ep 12.8 12.5 15.15 17.7 
Ez 10.9 9.5 12.25 15.7 
Cii GPa 118 101 83 66.7 
p g/cm? 5.32 4.81 5.67 5.78 
horo meV 35.4 42.8 29.6 23.6 
D eV 8.0 6.4 6.0 14.0 
OL 0.067 0.125 0.055 0.020 
To ps 0.80 0.51 0.85 2.09 
Ur km/s 378 341 581 837 
UrTo um 0.30 0.18 0.50 1.75 
Lat um 2.0 1.8 13.9 8.8 
l mm 3.2 2.5 80 134 
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8.3.3. Temperature Relaxation 


The relaxation time for temperature is required when the electron temperature T, and the 
phonon temperature T, are different. The electron-phonon interaction will allow energy 
exchange between these two systems, and gradually bring them to the same temperature. The 
rate at which they equilibrate defines the temperature relaxation time. The result for metals 
was introduced by Allen (1987). The same quantity is important in semiconductors. There are 
several situations where an experimentalist might encounter T, # T,. Since the electrons 
absorb electromagnetic radiation, an intense laser pulse could raise T, above T,. Similarly, a 
strong dc electric field could accelerate electrons to have an average kinetic energy well above 
the phonon temperature, and then the two systems would mutually relax to the same 
temperature. 

The starting expression for this calculation is the collision term (8.137) in the Boltzmann 
equation. The expression is simplified with the following assumptions: 


e The electrons are in thermal equilibrium among themselves with an effective 
temperature T,, Be = 1/kgT, and distribution function f (p) = np(§,). This relative 
equilibrium is maintained by rapid electron—electron scattering. 

e The phonons are in thermal equilibrium among themselves with an effective 
temperature T,, P, = 1/kg7,, which is maintained by rapid anharmonic phonon- 
phonon scattering. They are described by a distribution function ng(@,). 

e The electrons and phonons exchange energy according to (8.137). 


Always keep in mind that the boson distribution function ng(œ) is at a different temperature 
than the electron distribution functions n;(€), np (£). The expression can be further simplified 
by using the function «*F(w) to express the integrals over the phonon states 
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where € = Ep, & = €),,. Next, the change in the internal energy is calculated. The change in 


energy is accomplished by inserting under in the integral the factor of £’ — £ = tha. 
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In order to average over the entire system of electrons, it is necessary to also integrate over de. 
If N(e) is the density of states for the electron system, the energy change in the electron 


system is 
dU dTe dE(s) 
T =C- = fa eN (e (Fe ), (8.176) 


where C is the heat capacity. The integrals over de all converge within a thermal energy of the 
chemical potential. On this small energy scale, the density of states N (£) can be taken to be a 
constant N(0), where the zero of energy is the chemical potential. Then all of the integrals 
over de have the typical form 


I(@) = | dene —n,(€ — œ)]| = = WNp,(@) (8.177) 
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The subscript “e” is added to the boson occupation function ng,(@) to emphasize that it is 
evaluated at the electron temperature T,. The above result can be derived by changing vari- 
ables of integration to s = exp(B,&), ds = sB,de, t = exp(B,@) which changes the integral to 


0° . ds kpT. s+ 1 °° 
I(@) = kT ——— = —|] = 
(o) B | (s + 1)(s + t) eho — 1 n + |. ON ge(@) 


The other integrals over de are evaluated in a similar fashion. The result for the energy 
relaxation is 


aT, 
C dt 


—4nh?N of dow’ a’ F(w)[nz.(@) — n Bp(©)] (8.178) 


The final formula is quite simple. The right-hand side of this equation obviously vanishes in 
equilbrium when T, = T,. Another simplification occurs because the heat capacity is 
proportional to the density of states C = n?k2TN(0)/3 which simplifies the expression to 


dT, 128? 
dt  —s- nkT, 


N dww’ a’ F(o)[nz.(@) — Ng (©)] (8.179) 


The only factor which relates to the particular solid is «?F(). 

There are two interesting limits when evaluating this expression. The first is at very low 
temperature, T, < ©, where the Debye temperature is ©. Then the integral over œ has its 
main contribution at small values of œ. The small values of frequency come from sound 
waves, where it is suitable to use the Debye approximation. The evaluation of «?F (œ) in this 
limit is identical to the semiconductor case, which gives that «?F = yœ? where y is a 
constant. The frequency limits can be extended to infinity, which gives the expression 
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The relaxation obeys a T° law, which means very little heat is exchanged between electrons 
and phonons at low temperatures. 
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The other interesting limit is at room temperature, or at least above the Debye 
temperature. Then the boson occupation factors can be expanded ng © kgT/hw which gives 
the simple result 


aT, (T; — T,) 


(8.183) 
dt Tyr 
1 6h 
— = (a? 184 
tp nksT, (œ^) (8.184) 


At high temperature the temperature relaxation is determined by the temperature difference 
ôT = T, — T,, as well as by the function A(@2). The latter function also determines the 
energy relaxation. 

The temperature relaxation is measured by lasers using pulse-probe techniques. The first 
laser pulse is absorbed by the electrons, which thermally excites them, which causes their 
temperature to differ from the phonons. The time-delayed probe measurement determines 
how some property, such as the refractive index, varies with the time interval after the initial 
pulse. An observation of an exponential relaxation curve is a measurement of 1,7. 

The above result is valid for a metal. A similar derivation can be done for a semi- 
conductor, which is assigned in the problems. The same calculation for scattering of electrons 
in a semiconductor, by optical phonons, can be done in a interesting and rigorous way. For 
optical phonon scattering in a semiconductor, all of the phonons have the same energy fw, so 
there is no need for an integral over frequency. In fact, just set 


x F(©) = 0 30 — Wp) (8.185) 


where A is the dimensionless coupling constant. Starting from (8.173) gives 


(=) = —TAWo{[ (€)(No + 1) — Nof (E€ — @)IO(E — %0) 


+f(E)No — f (€ + @p)(No + 1} (8.186) 


The states f(e + lœ) can be regarded as a ladder of energy states, where each level is 
separated by œp. The above equation can be solved exactly by matrix methods. Define 
fi =f(€ +/@_) where the energy £ is now limited to the range 0 < € < Wy. The vector f has 
fı as its elements. The above equation can now be cast into a matrix equation, where No is 
the occupation number of the optical phonons. 


d - 1. 
“ f=-—M-f (8.187) 
t oT 
Ny -M +1) 0 
M=\|-N, 2N,+1 -M+ -- (8.188) 
0 —Np 2N) +1 
1 
ToT 


The matrix æ has tridiagonal form. Only three rows are shown, but its dimensionality is 
infinite. All row except the first have elements: —Ny, 2Ny + 1, —(No + 1) to the left of the 
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diagonal, on the diagonal, and to the right of the diagonal. If the matrix æ has eigenvalues Ej 
and eigenfunctions wy; then the solution is 


f= DCA or (8.190) 
J 


where the coefficients a; are determined by the initial conditions. The equilibrium distribution 
is given by £ = 0, WV) = exp(—/B@,), where B is determined by the phonon temperature. The 
system relaxes towards the equilibrium distribution, which is why its eigenvalue has to be 
zero. The other eigenvalues and eigenvectors are given by Mahan (1985). 


8.4. ELECTRON-PHONON INTERACTIONS IN METALS 


8.4.1. Force—Force Correlation Function 


In pure metals the electrical resistivity has two components. There is usually a constant 
resistivity from electron scattering by impurities, which is the largest part of the resistance at 
small temperatures. There is also a resistivity from electron scattering by phonons, which is 
temperature dependent and becomes large at high temperature. “Matthiessen’s rule” (1862) is 
that these two contributions to the resistance are additive. It should be regarded as a rule of 
thumb, rather than an ironclad rule. There are enough “deviations from Matthiessen’s rule” to 
make the abbreviation DMR a familiar acronym (Bass, 1982). 

Electron scattering by acoustical phonons presents a hard problem in transport theory. 
The scattering is slightly inelastic. This problem is solved using neither the elastic scattering 
theory of Sec. 8.1 nor the inelastic scattering theory of Sec. 8.2. Instead an integral equation 
must be derived and solved for the energy dependence of the scattering process. The slightly 
inelastic nature of the scattering process makes this calculation much harder than the previous 
cases. 

Two methods for obtaining the electrical conductivity are emphasized in this book. One 
uses equilibrium methods and evaluates the Kubo formula for the current—current correlation 
function. The resistivity from phonons will be found using this method, which follows the 
original derivation by Holstein (1964). The second method utilizes the quantum Boltzmann 
equation (QBE), which is a nonequilibrium theory. Mahan and Hansch (1983) used the QBE 
to derive the Holstein formula. Both of these derivations are complicated. They end by 
deriving the same integral equation for the scattering function, which must be solved by 
further work. Their virtue is that they are formally exact starting points, although approx- 
imations are made in obtaining the solution. 

Other methods for obtaining the resistivity have been proposed, partly to avoid all of the 
work associated with the exact methods. These other methods are approximate. However, 
often the theories are both simple and accurate, which make them useful approximations. One 
of them is the force—force correlation function. If F(A is the fluctuating force that acts on the 
electron, then define R(iw) as the force—force correlation function: 


B 
R(iw) = — : I de" (T F(t) + F(0)) (8.191) 


_ li. [imR,.(@) 
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This formula is just the quantum analogy of the Nyqvist theorem (Kubo et al., 1985). After 
calculating this correlation function, the retarded function is obtained by letting ia > œ + ið. 
The resistance p is found by dividing by œ and taking the limit of œ — 0. For example, 
assume that the force on the electron has two terms: F; from impurities and F,;, from phonons. 
If they are uncorrelated, then the correlation function has no cross terms. Symbolically write 


R = (F; + F,,)° (F; + Fyn) = (Fi Fi) + (Fpa Fon) (8.193) 


In this case the resistivities from impurities and phonons are additive, in agreement with 
Matthiessen’s rule. 

As an example, the resistivity is calculated from impurity scattering. The potential 
energy of the electron scattering from the impurities at R; is discussed in Sec. 4.1.5 


V(t) = E Valt- R) =~ Vq)explig: (r — Ri) (8.194) 
i iq 


The force F is the gradient of the potential. The factor of exp(iq - r) can also be written as the 
electron density operator p(q). The factor of exp(—iq:R,) can be written as the impurity 
density operator p,(—q) 


FE) =~ E aA oplig: (rR) = -E E arD) 
iq i 


The next step is to evaluate the force—force correlation function R(iœ). In correlating F with 
itself, there are two separate factors. One is (p,(—q)p,(—q’)), which equals the number of 
impurities N; if q = —q’. The other is the electron density—density correlation function, which 
is given exactly in terms of the inverse dielectric function 


N, 2 
Ro) =~ Era] 


—] 8.195 
a (6195) 
The next step is to take the imaginary part of the retarded function. The only retarded function 
on the right-hand side of the above equation is the inverse dielectric function £ = €, + ie». Its 
imaginary part is —e, /(s? + £2). At low frequency £, = 2we*m’/q’. Dividing by œ eliminates 
the factor of frequency. The formula for the resistivity from impurity scattering is 


_ n;m? (<2 V(q) 
~ 6mnze? (Qn) &(q) 


This formula is the exact result for the zero-temperature resistivity from impurity scattering, 
when the scattering is calculated in the second Born approximation. If V(q)/€(q) is replaced 
by the 7 matrix for scattering, then it is the exact result, period. It is the formula 
p = m/(noe’t,), where t, is defined in (8.25). It even includes the factor of (1 — cos 8°), 
although this assertion is not immediately obvious. One has to perform the angular integral in 
(8.25), which eliminates the delta function, in order to show its equivalence with the above 
formula for the resistivity. 

The force—force correlation function gives the right resistivity for impurity scattering. No 
vertex correction or integral equation was needed in the derivation. The ease of derivation has 
made this approach popular. 

Several caveats are needed. One is that impurity scattering is the only known example 
where the force—force correlation function gives the correct answer. In other cases it give an 
approximate answer. The second caveat is that the right answer is obtained by a wrong 


2 


(8.196) 
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derivation. The derivation contains two important limits. One is setting the volume v > ov, 
while the second is œ — 0. The above answer is obtained by taking these limits in the wrong 
order. If they had been done correctly, in the right order, a different answer is obtained. Using 
the right order in evaluating the force—force correlation function gives a zero result as œ — 0. 
These points are discussed by Argyres and Sigel (1974), Huberman and Chester (1975), Kubo 
et al. (1985), and Fishman (1989). 

The force—force correlation function may also be evaluated for the electron scattering by 
phonons. The result is 


a d 
AT) =C F faaroo e a -2 (8.197) 
Sh 
E TIAR (8-198) 


Here W(q) is the screened electron—ion interaction and vp is the unit cell volume. This 
formula was first derived by Ziman (1960) as a variational solution to the Boltzmann equa- 
tion. It is the formula that is most often evaluated when calculating the temperature depen- 
dence of the resistivity of metals. Figure 8.8 shows a theoretical calculation of Dynes and 
Carbotte (1968) compared with experiments for Na and K. An important feature of these 
calculations is numerically integrating over the Brillouin zone for all the phonon states, while 
employing accurate values for the phonon frequencies @,(q) and polarization vectors &. 


10°p (ohm-cm) 


© This Work 
— Experiment 


40 60 80 100 120 140 ~~ 160 
T(°K) 


FIGURE 8.8 Resistivity as a function of temperature for Na and K. The solid line is experiment, and points are 
theory. Source: Dynes and Carbotte (1968) (used with permission). 
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8.4.2. Kubo Formula 


A rigorous theory of the dc electrical conductivity is calculated for the scattering of 
electrons by all phonons: acoustical and optical. The starting point is the Kubo formula, and 
the derivation follows the original one by Holstein (1964). His theory sums the ladder 
diagrams for phonons and reduces the vertex function to an integral equation which is solved 
numerically. So far a solution is available only for a spherical Fermi surface, so that the result 
is the Kubo formula analogy of (8.198). The results are expressed in terms of the McMillan 
function «”F(w) and a similar function «?F(), which is used in transport theory. 

The goal is to evaluate the current—current correlation function in the presence of the 
electron-phonon interaction. This correlation function can always be expressed as a product 
of two Green’s functions and the vertex function. The Green’s function Y(p, ip) used here 
represents fully interacting particles, with a self-energy found from the electron-phonon 
interaction plus any additional interactions of interest. The Feynman diagram for the corre- 
lation function is shown in Fig. 8.9 where the vertex function is put only at one end of the 
bubble in order not to overcount the vertex terms: 


1(P 
no) = =z; | dret TiO O) 


e? io,t 
=- Eel dre" (T.Cha(t)Cpg(t)Chg()Cyre(0)) (8.199) 


2e dp 1 
nlio) = =| p ip)G(p, ip + iœ)p > T (p; ip, ip + iœ) (8.200) 


OBG 


The dc conductivity is found by the same steps used in Secs. 8.1 and 8.2. One evaluates the 
correlation function for values of iœ and analytically continues iœ —> œ + ið to find the 
retarded function. The dc conductivity is the imaginary part of the retarded function divided 
by œ, in the limit where œ — 0. 

The vertex function I(p; ip, ip + im) is evaluated below. It depends on both the 
frequency variables ip and iw, which is written in the combination (ip, ip + iw). The two 
arguments ip and ip + im come from the electron Green’s functions which have the same 
frequency arguments. In a homogeneous electron gas, the vector vertex function 
I'(p; ip, ip + im) must point in the vector direction p, although in real metals the crystalline 
potential defines other possible directions. However, the approximation of treating the Fermi 
surface as strictly spherical is equivalent to neglecting crystal directions, so assume I points 


ip +iw 


FIGURE 8.9 
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in the direction p. It is convenient to introduce the scalar function y(p; ip, ip + iw), which is 
the amplitude of the vector vertex function: 


F'(p; ip, ip + iœ) = py(p; ip, ip + iw) (8.201) 
2e f dp , 
io) = —— ; 8.202 
nio) = 53 | SPsP*SE. io) (8.202) 
S(p, iœ) = B> F(p, ip)A(p, ip + iw)y(p; ip, ip + iœ) (8.203) 
ip 


The scalar function y is not the same as the scalar vertex function in (8.81). 

The next step is to evaluate the summation over Matsubara frequencies ip to obtain 
S(p, iw). To this end, construct the usual contour integral which has cuts along the axes where 
ip —> real and also ip + iw — real. These series of algebraic operations are the same as used 
to derive (8.64) for impurity scattering. That earlier result can be used here by identifying 
P= GGy 


CO 


d 
Im[S,.(P, ©)] = rel | Emeto — ne) 


~œ 
x [Gaay(P, E)Gret(P, E€ + œ)y(p; €— id, E ++ ið) 
— Gret(D, E)Gre(P, € + O)Y(P; € + ið, € +O + a) (8.204) 
and the conductivity is 


2e fdp a de| dnp(e) 
3m? l Om Joo 2n| de 


x {lGre(p, £)I7Y(p, € — ið, € + iò) — Re[G,(p, €) Y(p, € + ið, e + i8)]} (8.205) 


Equation (8.205) is exact for the exact vertex function. It is expressed in terms of the two 
functions y(p; € — ið, e + ið) and y(p; € + ið, e + ið). These two functions are expected to 
be quite different, as they were in Sec. 8.1 for impurity scattering. The function 
y(p; € + ið, € + ið) could be obtained from a Ward identity. That is true here, as first shown 
for the electron-phonon system in metals by Englesberg and Schrieffer (1963). This Ward 
identity can be expressed in terms of F(p, e + ið, e + ið) or the scalar y(p, € + ið, e + ið): 


F'(p; £ + ið, £ + œ + iò) = p + mV X(p, £) (8.206) 
y(p; € + ið, € + œ + ið) = 1 + 5 20. €) (8.207) 
p 
0 
Gre(P, €) (p; € + 15, € + © + iò) = — Gea (P, £) (8.208) 


lems 


For metals, the electron-phonon system has the feature that the self-energy function X(p, £) is 
not very p dependent, and the derivative of È(p,£) with respect to Ep is small: 
dx/d&, ~ &/E- <1. A good approximation is to set y(p, € + ið, e + ið) = 1. 
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There is another result which is even stronger, since it applies for all values of £ near the 
Fermi energy. Since the self-energy ÈŁ(p, €) does not have significant p dependence near 
DP ~ kp, call it X(k;, £) = Re[X(e)] — il (e) where the p notation is suppressed. The function 
r(e) = —Im[}(kr, £)]. The retarded and advanced Green’s function have their important 
dependence on p through the kinetic energy term, § = &p. Define Q(e) = e — Re[X(e)]: 


1 1 
Gre(p, €) = e — Ẹ — Refè (e)] + iT (e) ~ Oe) — € + iT (e) (8.209) 
Gaav (P, e) = l l (8.210) 


e—& —Re(X(e)] iT (e) Ae) -E iT(e) 


Similarly, the vertex function y(p; € — ið, € + @ + ið) is assumed to not have a significant 
dependence upon p except on the order of kp. This assumption will be justified later. The 
second term in brackets in (8.205) can be neglected; it vanishes by doing the kinetic energy 
integration. By neglecting terms of order O(€/E;,) then 


dpp? = 4np*dp = 4nmk}-dE[1 + O(6/Ep)] = 120? mngdé (8.211) 


where the electron density is ny = k}./3n?. When the kinetic energy integral is evaluated, the 
only € variation is in the Green’s functions. Of the three combinations which occur, only one 
makes a nonzero contribution: 


| Eau, Gup) = 0 (8.212) 
G Gap, £’) = ont 8.213 
B SGret(P: €) aP E) = 07) 0) + PO rO] (8.213) 
| OP Gur.) = 0 (8.214) 


Each integral is evaluated by closing the contour at infinity. The two integrals which vanish 
have both their poles in the same half plane (upper or lower), so the integration contour can be 
chosen to avoid them both, which encircles no poles and hence gives zero. The integral over 
the combination G,,,G,., has one pole in each plane. Closing the contour always picks up one 
pole whose residue produces the result. By using these integration results, the dc conductivity 
(8.205) becomes 


_ enf AE d 
oOo = Fm | ei |- Tno] (8.215) 
A(£) = y(kp, € — ið, € + ið) (8.216) 
T(e) = —2 Im[E(kr, £) (8.217) 


Equation (8.215) is the final result of the formal derivation. There only remains the evaluation 
of the vertex function A(£) and the imaginary self-energy I (e). If the vertex function were 
absent (A = 1), then the evaluation would be easy. The imaginary self-energy I (e) has been 
evaluated for many metals, and a result was given in Fig. 7.14 for Pb. The quantity 
1/21 = t(€), where t(€) is the relaxation time defined as the average time between scattering 
events. The result (8.215) when A = 1, is just the average of the relaxation time over the 
thermally smeared Fermi distribution. Of course, the earlier solution for impurity scattering 
showed that the neglect of A(€) is a serious error. The vertex function A(e) serves the 
important role of weighting the scattering events and favoring those at high momentum 
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transfer. For impurity scattering, A = [/T, or A/T = 1/T,, where T, is the scattering rate 
which contains the equivalent of (1 — cos 6’). 

The vertex equation for phonon scattering is now solved to derive I’,. Since the phonon 
scattering is inelastic, the results are not identical to those for impurity scattering. In summing 
the ladder diagrams for the vertex function, however, the vertex contributions are found to be 
important and significantly different from unity. Migdal’s (1958) theorem, which asserts that 
vertex terms are unimportant, is contradicted. 

The vertex function I (p, ip, ip + iœ) is calculated by solving the integral equation 


F(p, ip, ip + iœ) =pt+—- P pe (ony LT M,(q) F(a, iq) (p + q, ip + iq) 


x lp +q, ip + iq + io (p + q, ip + iq, ip + iq + io) (8.218) 


This vertex sums the ladder diagrams for phonons. It is illustrated in Fig. 8.10. Iteration of 
Eq. (8.218) produces a series in which each additional term has one more ladder diagram. The 
solution to the integral equation produces an expression which contains all terms with any 
number of phonon ladder diagrams. This solution is not an exact evaluation of the vertex 
function I'(p, ip, ip + iœ), since other vertex contributions occur which are not ladders but 
have the phonon lines crossed. One expects these terms to be smaller, but detailed calcula- 
tions are lacking, so this is only a supposition. 

It is unfortunate that the vertex function is a vector. The scalar vertex function obtained 
by replacing p by 1 in Eq. (8.218) is easily obtained from a Ward identity: 


rP pipt g SMa) HA, ig 90+ 4 ip + ia) 
x Gp+q,ip+iqg+io)(p+4q, ip + iq, ip + iq + io) 
=1]1— Ep, ip + iœ) — X(p, ip)] (8.219) 
The Ward identity is not helpful for our problem with a vector vertex function. The integral 
equation must be attacked and solved. 


Define the scalar function y(p, ip, ip + iœ) as the scalar amplitude of the vector vertex 
function I = py. It obeys the integral equation 


M, (q 2 q) E 


1(.ip,ip + io) = 145 [£4 


p:(p+q) 
ig,» p” 


(27) 
x Gp+q)Gpt+qt iol (p + q, ip + iq, ip + iq + iw) 
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ip +iq 


FIGURE 8.10 
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This equation is not the same one which is obeyed by the scalar vertex function in (8.219), 
and these two functions are quite different. 

First do the integrals over angles and wave vector. First write d*>g = 2nq*dgqdv, where 
v = cos is the angle between p and q. The angle variable is changed to p,, defined as 


pi = (p+? =p’ +E + 2pqv (8.220) 
dy -2% (8.211) 
pq 
2 oo [p+q\ Phin co Ép+q 
l dq = =| qdq | pdp == | qdq | dé, (8.222) 
P Jo Ip—al Vo Jo Eng 


Next uncouple the limits of integration. Only electrons at the Fermi surface contribute, so that 
values of 5; = £, — p © 0 and €, ~ 0 are important in the integration process. Approximate 
v, = vp as the Fermi velocity. Similarly, the integral qdq is understood to be over the spherical 
Fermi surface from one point p to all other points p,, where p and p, both have magnitude kp: 


E + -fe a| Ñi (8.223) 


The limits on the dE, integral are extended between oo, since most of the integrand has 
large q values, where the actual limits on €, are from a very negative number to a very 
positive one. Since the main contribution is in the region €, % 0, this error is small. Consider 
the other angular factors in the integrand: 


PeO+d _ 142 - +yz -p — q] (8.224) 
p p 
2 
q G1 Sp 
~1—-4 40(S1, 2 8.225 
met Ee ze) (5.223) 


The terms &;/Er, &p/Er are neglected, since it is expected that &; and &, will be small—on 
the order of a Debye energy. On the other hand, the factor g”/2k2 need not be small, since the 
integration over phonon states has a significant contribution from high values of q near the 
edge of the Brillouin zone. This term must be retained. The difference in treating the factor 
(p? — p’)/k?, which is neglected, and q?/kz, which is retained, is that the former enters the 
average over electron states and the latter over phonon states. The integrals over the phonon 
wave vector can be expressed in terms of the function «*F(w), which was introduced in Sec. 
7.4: 


dP 
F(@) =- ZE | @ > (8.226) 
BF) = ESS Mla gilo- (0) (8.227) 


The first of these is just the McMillan function, which was defined in Sec. 7.4. Since the 
Fermi surface has been assumed to be spherical, «?F has the same value at each point on the 
surface, and the p subscript is omitted. The other form of coupling «?F is called the 

“transport form of alpha-squared-F,” which was introduced by Allen (1971). It differs from 
the McMillan form by having the additional factor of g*/2k? in the integrand, which gives 


542 Chap.8 e dec Conductivities 


more weight to the scattering processes at large wave vector. The factor of q°/2kż is identical 
to (1 — cos 6’) when the scattering is elastic. 

After completing all these angular and wave vector integrations, the vertex function 
Y(p, ip, ip + iœ) is found to be not very dependent on p. The only variation is as a function of 
p, which can be set equal to kp with an error of only &,/Er. Define y(ip, ip + im) = 
y(kr, ip, ip + iœ) and arrive at the equations: 


(A) 


y(ip, ip + iœ) = 1 + | "dujo? F (u) — 07 F(u)] 
0 


x | dé, S(E,, u; ip, ip + io) (8.228) 
o, I 2u a, 
S(§, u; ip, ip + io) = = a VP + ig, ip + ig + io) 
P iq (iq) — u 
x GE, ip + ig)G(E, ip + iq + iw) (8.229) 


This integral equation is not too difficult to solve. The basic approximation has been to 
decouple the integrations over dq and dé,, which permits all the phonon information to be 
collected into the functions «°F, «?F. The primary assumption in this decoupling is that the 
Fermi degeneracy Ep is very much larger than other energies such as kgT or œp. The present 
integral equation is actually much easier to solve than the one for polarons in Sec. 8.2, since 
there the integration variables cannot be accurately decoupled in the same way. 

The preceding is the basic integral equation which needs to be solved for the vertex 
function. The € dependence of S is only in the Green’s functions G(E,ip+iq) and 
G(E, ip + ig + iœ). This integral is done later, using the result (8.213) that only the integral 
over the pair G.,Gaay = |G|? is nonzero. The factor 2u/[(ig)’ — u?] is the phonon Green’s 
function for a phonon of energy u. The next step in the derivation is to do the summation over 
Matsubara frequency ig, which is done in the usual way by constructing a contour integral. 


dz 2u 
bo z np(z)y(z + ip, z + ip + iœ)G(z + ip)G(z + ip + iœ) 


The contour in integration is a circle at infinity. The integrand has poles from the phonon 
Green’s functions at z = +u and cuts along the axes where the electron Green’s functions are 


real, z = —ip + € and z = —ip — iœ + £’. The contour integrals from these three contribu- 
tions are 
S = Si +S +83 (8.230) 
Sı = nglu)y(lip + u, ip + iœ + u)G(ip + u)G(ip + im + u) 
+ [ng(u) + 1]y(ip — u, ip + im — u)G(ip — u)G(ip + iœ — u) (8.231) 


dg’ 2u 
S — ——— G / : / . / ° / 
2 l mie ip i Tp we (e +iw)[y(e + ið, € + 10)G,(€ ) 


— y(e’ — ið, £ + iw)G,,4,(€’)] (8.232) 
de! 2u 
S — _ / _ 7 / _; / ° / 
= [Fripp as Oe tM! — i0, e + DG) 


— y(' — iw, £ — i8)G,,4,(e’)] (8.233) 
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The next step is to perform the analytic continuations. The final result should be A(e) = 
y(e — ið, e + ið). First set ip —> € — ið. The next step is to set ip+imw —> £+ io > 
e +0 +ið > e+ið, since œ — 0. The Green’s functions become advanced or retarded 
according to the side of the cut, so this analytical continuation produces 


S = ng(u)ACE + u)|GialẸ, € + u)? + (ng) + ACE — WIG (6, £ — u)? 


+ | z np(E') [ Gret(§, £) PAC) (Ge ere) 


(£ —e— ið -u (£ —2 + 18) — u 


+ OC, a) (8.234) 


The next step is to do the integration over d&,. According to (8.213), this integral eliminates 
all combinations of the Green’s functions except G.eGaay = |Gre|”, Which removes the terms 
O(G, G2,,). Also note that the factor below is the phonon spectral function 


adv 


2u 2u 
—__,_. — —___,___ = 2nf&(e’ — e — u) — 86’ - e + 
(e/ —¢ —i8) — u (£ — s+ i8) — u [ ( u) ( u)| 


which eliminates the integral over de’. These manipulations provide the final form of the 
integral equation for A(e). It was first derived by Holstein (1964), although his result is 
modified by expressing it in the «*F' formalism: 


A(eé) = 1+ rf aloe F (u) — 0? F(u)] 
0 


A A(g — 
; it tng +e + fg) +1 = nre- u) PER] (8.235) 
meP AO] ane 
~ Om [acne de | 


The integral (8.215) for the dc conductivity was rewritten in order to present the two 
important results together. The integral equation for the vertex function A(£) must be solved, 
and the solution is used in the integral over £ for the conductivity. Recall that A(e) is a real 
function, so the integral equation is not complicated. The form of the equations suggests the 
unknown quantity is actually t(¢) = 2A/T, which might be called the effective relaxation 
time for transport. 

The energy uncertainty I (e) of the electron from the electron-phonon interaction was 
given previously in (7.305). The equivalent result for the transport kernel is: 


I (e) = —Im[2] = n| dua F (u)[2ng(u) +n-(u + £) +n-(u — £)] 
(8.236) 


(A) 


Te) = n| duo? F(u)[2ng(u) + nele +u) + np(u — 2) 


0 
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If the self-energy function I’ and the vertex function A(e) were both constants and inde- 


pendent of £, the solution of the vertex equation (8.235) would be easy: 


A=1 +å -T (8.237) 
(8.238) 


T 
=F 
This model gives A/T = 1/T, so that the conductivity integral has only the transport form of 
the relaxation time. The earlier solution to impurity scattering also assumed that A/T = 1/T,,. 
For impurity scattering the functions I (£) and A(é) are usually insensitive to £ near £ ~% 0; the 
exception is when the impurity has a scattering resonance near the Fermi surface or if the 
density of states is not smooth. However, for the electron—phonon interaction in metals, it is 


not a good approximation to treat I (e), I’,(e) or A(€) as constants. The calculated results in 
Sec. 7.4, shown in Fig. 7.14 for Pb, illustrate that I (e) has substantial energy variations near 


the Fermi energy. The vertex function does also. 

Takegahara and Wang (1977) evaluated (8.235) and (8.215) for metallic rubidium and 
cesium. Their results are shown in Fig. 8.11. In each case the solid line is calculated assuming 
that the ratio T/T, is a constant, while the dashed line is calculated by solving (8.235) for the 
e dependence of A(e). The latter curve is in very good agreement with the experiments, which 

are indicated by the points. The differences between the solid and dashed curves are similar to 
the differences between the Ziman formula (8.198) and the Holstein formula (8.235). Note 
that there is no region with a well-defined T° law for the resistivity in these metals. 


1.07 (a) Rb k 1.OF (b) Cs 4 
£ 0.8 g 0.8 ze 
N A ° 
a 0.6 eo 0.6 Y 

10-7 © 0.4 o 

d 10-7 af 
j 
J j 
j 
j 
1078 i i 
q 1078 H 
H lj 
i ) 
~ I j 
E I l 
o -9 I j 
c 10 i ‘ 
x ! 10-9} 4 
1 i 
q i 
40 -10L_t 
1 5 10 50 100 { 5 10 50 100 
T (K) 


FIGURE 8.11 Calculated constant volume phonon limited electrical resistivity for (a) Rb and (b) Cs. Solid lines are 
calculations that treat A(€) as a constant. Dashed lines are full solution to integral equation for A(e). Points are 


experimental values. Source: Takegahara and Wang (1977) (used with permission). 
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8.4.3. Mass Enhancement 


The electron-phonon mass enhancement factor 1 was introduced in Sec. 7.4. It is from 
the real part of the electron self-energy due to the electron-phonon interaction. Since this self- 
energy was found to be energy dependent but not very wave vector dependent, the electron 
effective mass m* is approximated by 


M@) = — ~ L(kp, œ) (8.239) 
m 1 
T= 1AA) = z (8.240) 


The mass enhancement factor is also related to the quasiparticle renormalization factor Z(œ). 
The values of these quantities at the Fermi energy œ = 0 are à and Zp. The mass renor- 
malization factor can also be expressed as an average over the Fermi surface of a weighted 
average over the phonon density of states. This definition is given in Sec. 7.4. Values of à in 
real metals range from 0.1 to 3. 

An important question is the role which à plays in the dc transport properties. The most 
obvious approximation is to use the effective mass m* = m(1 + à) in transport formulas 
whenever classical theory says to use m. This substitution would make sense, since m* is the 
effective mass which governs the motion of electrons on the Fermi surface, and these are 
involved in dc transport properties. However, this sensible procedure is wrong in most cases. 
The important point is that the factor 1 + à can enter the final formulas in several ways. It 
changes the effective mass, relaxation time, and quasiparticle renormalization factor Zp. The 
formula for the transport coefficient will have a number of factors of 1 + A. Often they all 
cancel, which is the case for the electrical conductivity. 

Prange and Kadanoff (1964) investigated which transport measurements were influenced 
by the electron-phonon mass enhancement factor 1 + à. They concluded that the enhance- 
ment did affect the following measurements: specific heat, low-field cyclotron resonance, and 
the amplitude of the deHaas—van Alphen effect. The following measurements are not affected: 
dc electrical conductivity, thermoelectric power, thermal conductivity, the period of the 
deHaas—van Alphen effect, spin susceptibility, and the electron tunneling rate. Their 
conclusion on the thermoelectric power was challenged by Opsal et al. (1976), who detected 
a dependence on 1 + À. The list of quantities which are affected is much shorter than the 
list of quantities which are not affected. The usual case is that the transport property 
is not influenced by the mass enhancement factor; see Grimvall (1981) for a further 
discussion. 


8.4.4. Thermoelectric Power 


The thermoelectric power is a subject which is seldom discussed in most solid-state 
courses, as the lecturer is busy treating subjects which are more fashionable. This tendency is 
regrettable, since it is an important measurement. Experiments show that diverse behaviors 
are found for simple metals, and even the sign of this quantity shows no regularity. The low- 
temperature theory in metals is poorly understood, although there has been extensive work. 

The thermoelectric power was introduced briefly in Sec. 3.8. It is a simple measurement, 
at least conceptually. A conducting bar is insulated so electrical currents cannot exit from its 
ends, and then a temperature difference AT is maintained along the length of the bar. The two 
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ends of the bar are found to have a voltage difference AV which is proportional to AT. The 
constant of proportionality is the thermoelectric power S: 


AV 1 L02 
LCD — kT P 1O,T(T i -i(0 8 942 
= iov |, te (T į(T) ° j(0)) (8.242) 
kpT 
LY = [ arom Tjo) - j(0)) (8.243) 


The thermopower is defined theoretically as the ratio of two correlation functions. One is just 
proportional to the de electrical conductivity o = BL“, which has already been evaluated for 
several models. The other correlation function has the argument of heat current jg and the 
electrical current j, where jọ = jg — (u/e)j. Equation (8.241) differs by a factor of charge e 
from its earlier version in Sec. 3.9, since earlier j meant the particle current, whereas now it is 
the electrical current. They differ by the unit of charge, which causes the change in (8.241). 

The correlation function L? is now evaluated for its dc value. Both correlation func- 
tions are evaluated for values of iw; then analytically continue iœ —> œ + io. Finally take the 
limit œ — 0 of the imaginary part of the retarded function. 

In the evaluation of most correlation functions, there is usually a leading term which 
provides the dominant part of the answer. There are numerous small correction terms which 
can usually be ignored. In calculating the correlation function LO? for the thermoelectric 
power, the dominant term vanishes, and one is left with obtaining all the numerous small 
correction terms. This feature makes it difficult to obtain an accurate answer. 

The heat current operator has many terms. The one which is expected to provide the 
dominant term is from the kinetic energy of the electron: jọ = >» VpSp Cho Cpo: This heat 
current operator is used in most theories. Similarly, for the electrical current the operator is 
j=e} v c} oCpo: The correlation function is called L“?”: 


1 


p029) — 
VBia 3 


7. 2 k» ral dre® ‘ (T. Clo ()Cko(1)Cio 


The important feature of this correlation function is the bracket containing four electron 
operators. This type of operator sequence was encountered in earlier sections of this chapter. 
It is evaluated as 


] 


los) — 
vB io = 


a) aD Ék 3 k-T(k, ik, ik + in)G(k)G(k + iw) 


where I°(k, ik, ik + iœ) is the vector vertex function of the bubble diagram. This same vertex 
function enters into the correlation function for the conductivity. Write it again as I = ky. 
There is the same summation over Matsubara frequency ik which was done in the previous 
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section for the correlation function L”. The result was given in (8.204). The retarded form 
of this correlation function is 


2e k © de 
Imir 22 =- e | de 7 
ml ret ] 3Bm2o (2n)° Ëk o an [np(E + @) nr(€)] 

x {Gagy(K, €) Gre (K, £ + @)y(k, £ — ið, € + © + ið) 

— Re(G,,.(kK, €)G,(k, € + o)y(k, £ + ið, € + @ + 16)]} 


The next step is to take the limit @ — 0, which causes the electron occupation factors to 
become dn;(s)/de. The integral over wave vector is changed to an integral over 
kdk = 12x? mngdé,: 


ime) = | ES SEO REG 8)? 


x y(k, e — ið, e + ið) — Re{G.,,(k, £) y(k, £ + ið, £ + i8)]} (8.244) 


Again it is assumed that the retarded and advanced Green’s functions are significantly 
dependent only on € in their kinetic energy term and that the self-energy and vertex functions 
have negligible dependence on €. Then the following integrals are evaluated by a contour 
integration in analogy with (8.213): 


D dEEGralE, 6)? = in 
nOQ(e) 


| aElGulE oP =F (8.245) 


|" deEGyy (b, €} = —in 


Two of the integrals equal tin, which comes from the semicircle closing the contour at 
infinity. These integrals give zero when taking the real part. The important contribution must 
arise from the combination |G,,,(6, £), since it has a nontrivial contribution from the integral 
over dé: 


(12a), _ My (©  [  dnp(s) | Q()AC) _ 
mL? = 3 [a E com a =? (8.246) 


This integral is zero after one evaluates the integral de. It vanishes because the integrand is an 
antisymmetric function of ¢. The quantities dnp(€)/de, A(e), and IT (e) are all symmetric 
functions of g, while Q(e) is an antisymmetric function. The correlation function L@*” is zero. 

The integral vanishes because of the single power of € in the d& integral. It makes the 
single power of Q(e) in the integral over de. The single power of & comes from the heat 
current operator. 

A nonzero result for the correlation function is obtained by repeating this derivation and 
retaining all the correction terms. One important term is from the argument of the wave vector 
integration: 


ka? k = Ink dk? = 3nmdé,[kp + (K — key”? 


3Ek 
and (1 + (8.247) 
F 


548 Chap. 8 e dc Conductivities 


The second term in parentheses makes a nonzero contribution to the thermopower. This term 
is now evaluated. Kinetic energy integrals such as (8.245) must now be done, except there is a 
factor of E° in the integrand which multiplies the Green’s functions. The evaluation of the 
integrand is tricky, since technically the integral diverges. At large values of &, then G ~ 1/€ 
and (EG)? ~ 1. The integrand does not fall off at large values of £, and taking the limits to 
-too gives an infinite integral. Usually this problem is solved by ignoring it. The product 
Gre |? = 4/21 ~ mO(G —¢)/T is replaced by a delta function for energy conservation. The 
integral over (ÉG,«) is set equal to zero: 


[EEE DP a F (8.248) 
The term in £? is thermally averaged according to (8.244) using G&R 3.531(3); 
o0) 2 
l dec?| -2| T (8.249) 
-o de 3 
mjt] = E er 8.250 
miee] = gge O) (8.250) 
k a ne keT 
S= -(% 2) im ]= -z3 nC) (8.251) 
CLF 


where the conductivity is © = nọe?t,/m. The dimensions of S are volts per degree. The 
parameter n(T) is dimensionless. So far n = 3 from the coefficient of &/Ep in (8.247). Other 
contributions to n(T) are derived below. 

Taylor and MacDonald (1986) evaluated this expression for the alkali metals at high 
temperature. Rather good agreement is obtained, as shown below. First it is necessary to find 
more contributions to n(T). At high temperature it is a good approximation to set T, = Tys, 
where the transport lifetime is entirely from phonons, and is given by the transport form of 
alpha-squared-f in (8.226): 


a 
Tot (k) 


Contributions to n(T) are obtained by expanding k about kp and keeping the first-order terms 
in (k — kp)/kp © &/2Ep. The prefactor of k~? gives another contribution of n = 3 to n. The 
integration limit of 2k gives a contribution to n of 


2k 2n 
On) are | T af do Y MK (anslo la] + arloa) 


2n 
T) =E tp] dd E MOK nalo Qk] + melo, 2h 


Another contribution to n(T) comes from the matrix element M, (q). It is usually calculated 
using a screened pseudopotential for the electron-ion interaction. The better pseudopotentials 
are nonlocal, which means they depend upon M,(k, k + q) rather than just on q. This k 
dependence can also be expanded around the point kp. The pseudopotential gives another 
contribution to n(T) which is called r(T). The contributions to n(T) are 


n(T) = 3 — 2q(T) — $r(T) (8.252) 


Table 8.2 shows the evaluation of these terms for the alkali metals at various temperatures. 
The comparison with the experimental data is good for Na, K, and Rb. The high-temperature 
thermopower seems to be understood in these cases. Both q(T) and r(7) are small for Na 
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TABLE 8.2 Thermoelectric parameters in the alkali metals. 
n = 3 — 2q — r/2 (Taylor and MacDonald, 1986, used with permis- 


sion). 
Metal TK) 247) 47/2 n 
Theory Experiment 
Li 424 9.26 —1.43 —5.33 —6.3 
Na 300 0.04 —0.09 3.05 2.9 
K 200 0.83 —1.87 4.04 4.0 
Rb 100 4.78 —4,49 2.71 2.8 


Cs 100 9.32 —7.15 0.83 0.0 


because the electron—ion pseudopotential is nearly zero at q = 2k,;. The thermopower is 
easier to evaluate at high rather than at low temperature. At high temperature the resistance is 
dominated by the electron-phonon interaction, which is well approximated by using the 
transport form of alpha-squared-f At low temperatures one also has to include the k 
dependence of impurity scattering, as well as the ordinary form of alpha-squared-f while 
solving (8.235). In addition, phonon drag is important at low temperature. 


8.5. QUANTUM BOLTZMANN EQUATION 


There are several different methods of doing transport theory. The theory used in the 
preceding sections uses the Kubo relation for the conductivity and is called “linear response.” 
One assumes that currents are proportional to fields. The proportionality constants can be 
evaluated in equilibrium. This method works because one assumes that the applied fields are 
small, and the system is only infinitesimally disturbed from equilibrium. 

A second method of transport theory is discussed in this section. One assumes the 
existence of a distribution function f, which describes the behavior of the particles. One writes 
a differential equation for the motion of f through phase space. The differential equation is a 
Boltzmann equation. One then tries to solve the Boltzmann equation for a system out of 
equilibrium. For fields that are small, the system is only slightly out of equilibrium, and one 
reproduces the linear response solutions described earlier. The advantage of the Boltzmann 
equation method is that one can also try to solve the equation when the system is far from 
equilibrium. 

The original Boltzmann equation described the behavior of a distribution function 
f(v, R, À of three variables: velocity, position, and time. The Wigner distribution function 
(WDF) was introduced in Sec. 3.7. It is equivalent to a distribution function f (k, œ; R, £) with 
four variables: wave vector k, energy œ, position R, and time ¢. This latter distribution 
function is the one needed for many-particle systems. Since f(k, œ; R, £) is not positive 
definite, calling it a “distribution function” is probably misleading. This phrase is used since 
it is widespread, but the warning should be kept in mind. 

The differential equation obeyed by /(k,@;R, 1) is called the quantum Boltzmann 
equation, which is abreviated QBE. It is derived rigorously in the following sections. Here a 
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quick derivation is provided using semiclassical arguments. According to the Liouville 
theorem a distribution function f (q;, t) is ee when it obeys the equation 


of 
of =0= 8.253 
y=0= T+ oF a+ (ZT) (8.253) 
where the last term is from scattering. For the WDF, k = F (force), R = v (velocity), and 
© = v’ F (Joule heating). The QBE is 


This equation has nearly the right form. It is derived rigorously in the next section, which 
produces a few more terms from self-energy contributions. That derivation also provides a 
prescription for obtaining the scattering term. 

Equation (8.254) has one feature that is important. The additional variable œ also causes 
a new driving term on the left of the form v» Fd/dm. This term was first derived by Mahan 
and Hansch (1983). The semiclassical distribution function f(v, R, 7) lacks this driving term 
since it lacks the energy variable œ. The QBE for the WDF is a different equation from the 
traditional Boltzmann equation. 


8.5.1. Derivation of the OBE 


The QBE is the equation of motion for the Green’s function G<. The method of deriving 
transport equations was pioneered by Kadanoff and Baym (1962). Recall from Sec. 3.7 that 
f(r, t; R, T) = —iG<(r, t, R,T), where (r,t) are the relative variables and (R, T) are the 
position and time in center-of-mass. The Green’s function was defined in terms of the field 
operator as 


G<(r, t R,T) =i R- Ar, T -AR +44, T +40) (8.255) 


The next step is to Fourier transform the relative variables (r, t) into (k, œ): 
G<(k, œ; R, T) = [Prem | dte’' G<(r, t; R, T) (8.256) 


The QBE will be derived for a particle in a weak electric field. The intent is to describe 
interacting many-particle systems that have a small current flowing in response to a small 
electric field. The derivation will be sufficiently general to include any kind of particles and 
nearly any kind of interactions. 

The electric field can be introduced as either a scalar or a vector potential. The QBE is 
independent of this choice, as required by gauge invariance. Here both are included, in order 
to provide the most general derivation. There will be an electric field E,, which is from a 
vector potential, and another electric field E, from a scalar potential. The final version of the 
QBE will include only the total electric field E = E, + E,. The scalar potential is introduced 
through the interaction term 


Hy = -eE Dr, (8.257) 
J 


The vector potential is introduced by changing the momentum of each charged particle to 
(p — eA/c), where the vector potential is A = —cE,t. A vector potential proportional to time 
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could occur in a wire loop with a slowly varying magnetic flux through the center. The present 
theory is for a dc electric field. 

Equations (3.331) and (3.332) are equations of motion for the four Green’s functions in 
the 2 x 2 matrix for G. The two electric field terms are added to the left-hand side of these 
equations—they are included in Hy. The derivation of the QBE involves several algebraic 
manipulations on the left-hand side of these equations. In order to avoid a lot of cumbersome 
notation, the right-hand side of these equations are not written during these steps. On the left, 
the steps are the same for all six of the real-time Green’s functions. The generic symbol G 
applies to any one of them. 

The two equations for G in (3.331) and (3.332) are first added and then subtracted: 


ð 0 
ij}——-—]-A, — H, |G = 8.258 
ie m) | 
(242) +H,\6= (8.259) 
At, ðh ed 


where H, = H)(r,, pı) and H) = Ho(r2, —p,). The two equations contain time derivatives 
that relate either to the relative or center-of-mass motion 
ð 0 0 0 ð ð 
— +n, Laa 8.260 
ot, Of ƏT Ot; Oty ot ( ) 
For particles with parabolic band dispersion, the sum and difference of the two Hamiltonians 
produce simple expressions in relative coordinates. As a first step, consider what happens to 
the two momentum terms which are in the form of (p — eA/c) 


p; + eE,t, = (p+ eE,7T)+ i (P + eE,t) (8.261) 
pz — cE, tf, = —(p + cE,T) +4 (P + eE,t) (8.262) 
where p and P are the relative and center-of-mass momentum. It is important to understand all 
of the various plus and minus signs. Since A = —cẸE,t then (p — eA/c) = (p+ eE,t) for p}. 
However, in H, the momentum enters as —p, which explains the sign change on the bottom. 


On the right one uses ft, 2 = T + t/2 and p, 2 = +p + P/2. These results make it easy to see 
the form of H, + H}: 


J l 

Hı +H, =—(p + eE,T) + Œ+ eE? — 2eE,:R (8.263) 
l 

H, — H, = — (p + eE,T) : (P + et) — eE, +r (8.264) 


Divide the top equation by two, and obtain the following two equations for the Green’s 
function: 


0 2 | 2 
2| i— — — E TY —— . = . 
p zP +e T) -P + ek) +eE, R|G (8.265) 
0 1 
izp 7, (Pt CELT) (P + eE,t) + eK, «rr G = (8.266) 


These two equations describe the relative and center-of-mass motion of the function 
G(r, t; R, T). The goal is to derive the QBE for the WDF G<(k, œ; R, 7). Fourier transform 
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the variables (r, 7) to the set (q, Q) as in (8.256). This transform changes p to q, r to iV}, d/dt 
to —iQ, and ¢ to —id/dQ. The two transformed equations are 


2 
fara, R- = (a+, Ty tga (Vet Z) jaaar n- 
(8.267) 


0 
BE (q+eE,7): (v + sE, Š) CSAC Q; R, T) = 


The second of these equations has, on the left-hand side, exactly the same terms as are found 
in the Boltzmann equation. This similarity suggests that the QBE is the same as the BE. 
However, this conclusion is incorrect. The above set of equations have several things wrong 
with them 


1. There is the term eE, R. This term seems to combine with Q to produce a center-of 
mass energy © = Q + eE,-R. Together they suggest that the energy of a particle 
depends upon its location. The energy is different at one end of a sample than at the 
other. However, this behavior is contrary to common sense. When there is a small 
electric field along the sample, and a small current flowing, we expect the system to 
be uniform. There is the same particle density, current density, etc. at each point in the 
solid. There is no dependence upon R. This undesirable term has to be eliminated. 

2. The relative momentum seems to enter in the combination of q + eE,7. It depends 
upon the center-of-mass time 7: This feature is also unphysical, and needs to be 
eliminated. 

3. The result is not gauge invariant, since the two fields E, and E, enter differently. 


All of these problems can be eliminated through a variable transformation. Of course, this 
transformation also causes some derivatives to change: 


QO-+eE,-R>0 
q+ eE,7 > k 


0 
Vr > Vg + eE, (8.268) 
0 0 
3T => 3T + eE,’ Vk 


These transformations cause the two equations in (8.267) for the Green’s function to now have 
the form 


l a? 
o-a + gg (Vet Ba] Jou o; R, T) = 


0 0 
apt Ye Vetek: (vini ) [ea 0; R, T) = 


(8.269) 


The notation on the Green’s function has been changed again. The arguments (q, Q) have 
been changed to (k, œ). The left-hand sides of these two equations are now in the form that is 
useful. The lower equation has exactly the same terms as in (8.254), and is the quantum 
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Boltzmann equation. The electric field is E = E, + E,. Both electric fields contribute in the 
same way, and the result is now gauge invariant. 

The above variable transformation makes the QBE gauge invariant. The results are valid 
for dc electric fields. A more complicated variable transformation is required for ac electric 
fields (Levanda and Fleurov, 1994). 

Now it is time to restore the scattering terms to the right-hand side of the equal sign. The 
scattering terms for G< and G,,, are in (3.341). They are the most important Green’s functions 
in applications using real time. The self-energy terms on the right are found by following the 
same steps to bring the left-hand side to (8.269). In doing these operations, keep in mind that 
the scattering terms are of the form 


| drs 106:2) 


First add and subtract the two equations for each Green’s function. There follows a series of 
variable transformations. They can be understood by examining just one term in the right- 
hand side of Eq. (8.269). The various variable arguments yield a fairly complicated expres- 
sion, which is derived by the following steps: 


1. Change the two sets of variables (x,,x3;)(x3, x) to the center-of-mass grouping 
(x1 — x3, œ +3)/2)(%3 — x2, (%3 + x2)/2). 

2. Change the integration variable from x, to y =x, — x3, so the variable grouping 
become (y, x, — y/2)(xy — xy — y, @ +x2 —y)/2). 

3. Change to center of mass variables x = (x, — x2), X = (x, +x,)/2, which produces 
the variable grouping (y, X + (x — y)/2)(x — y, X — y/2). 

4. Change the integration variable x to z = x — y which produces the final arguments of 
(y,X + 2/2)(z, X — y/2). 


Below are the full scattering equations for G< and G,,,. The explicit variables in the scattering 
terms are only written out in the first term, but are identical for the other three. The order of 
the terms in the scattering integral is important. The first one always has argument 
(y, X + 2/2) while the second one has argument (z, X — y/2). The four-vector in the exponent 
is q = (k, œ): 


1 aN? 
[o — & + 8m (v: + eK =) Jot O; R, T) 
1 , , 
= 145 |e | y Ea XX +2) X — 9/2) 
+ Gie, X + 2/2) Xret(Z, X — y/2)] (8.270) 


| a 0 
ER “Vre+teE: (va + Va) [Gul wo; R, T) 


= | dze" | dye? È et Gret 7 Gret ret (8.27 1) 
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l a\° 
o- Ek +g, (Vs +E) Jo wo; R, T) 


1 , . 
_ 5 | ace | dve"12,G° — EG + G,£< — GÈ] (8.272) 


ð ð 
|; + Vk ° Ver + eE ° (v + wg) [ore QO; R, T) 
0 dM 
= | dze” [peves — EG — G,X< + G<d;] (8.273) 


These four equations are the important ones for nonequilibrium calculations. Although one 
primarily wants to find GS, it is always necessary to first find G,,,. These equations were first 
derived by Fleurov and Kozlov (1978). 

So far no approximations have been made, and the equations are exact. The QBE has a 
linear term in the electric field E. However, the equation is exact to all powers of E, not just to 
the first power. 


8.5.2. Gradient Expansion 


Equations (8.271) are usually too hard to solve because the scattering terms on the right 
have a complicated form. Some sort of approximation has to be introduced to simplify the 
right side. The approximation described below is only valid to first order in the field. 
Kadanoff and Baym (1962) introduced an approximation for evaluating these scattering 
terms, which is called the “gradient expansion.” They assume that the center-of-mass time T 
is very large, and take the limit that T — oo. At large values of T, they assume that the system 
is approaching its asymptotic limit, so that variations with respect to T are small. Obviously 
the gradient expansion is not suitable for studying transients, since it is poor at small values of 
T. Neither is it useful for steady-state ac phenomena (Mahan, 1987). Indeed, the T dependence 
is so poorly described in the gradient approximation that it should not be used and the 7 
derivative terms should be dropped from the QBE. Nevertheless, the gradient expansion is 
used since it is applicable for homogeneous (small R dependence) steady state (small T 
dependence) systems. 

The center-of-mass variables are all in the form (R + AR, T + AT), which are expanded 
in a Taylor series about the point X = (R, T). The integrals can be done for each term in the 
series. These integrals usually cause further derivatives. The gradient expansion is shown 
below for a typical scattering term: 


[= | dze? |pewzo. X +2/2)G(z,X — y/2) (8.274) 


= | dze? l dye? L(y, X)G(z, X) 


y 0G(z, X) n 
2 dx 


G(z, X) — X(y, X) 


(8.275) 


= X(q, X)G(q, X) + ; [(V,2)VyG — (VyZ)V,G] ++: (8.276) 
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The first term on the right are usually the largest. It is customary to retain the first derivative 
terms and ignore higher derivatives. They can be expressed using a Poisson bracket notation: 


[Z, G] = (V,Z)VyG — (VyX)V,G (8.277) 
_ d2dG IGAL 
CL > . . . 
QIT QAT Vg" VrG + VaC' Vrè (8.278) 


These frequency derivatives are with respect to the Q variables, since the variable change 
(k = q + eE,T, œ = Q + e£,- R) has not yet been made. If this step is taken now, the 
derivatives get altered according to Eq. (8.268), which changes the Poisson brackets to 


[C, D] > [C, D] + cE: (5) VD — (2 ) vc] (8.279) 


where E is again the total electric field. This analysis finally derives from (8.271) the 
following expression for the nonequilibrium retarded Green’s function in an electric field, 
when G,« and Èe depend upon (k, œ; R, T): 


] aV 
o —& + 8m (v +eE =) -zu G,(k, ©; R, T)=1 (8.280) 


0 0m 
di t Vy Vr + eE: (1 — 9 Za)y, + (Vk + VX ret) mals ret — i[Z vet: Gretl» (8.281) 


The additional terms from the Poisson bracket, which are linear in the field E, have been 
transferred to the left of the equal sign. 

These equations simplify for nonequilibrium systems which are both homogeneous 
(Vr = 0) and steady state (0/dT = 0). The Poisson brackets vanish, as do several terms on 
the left. Also ignored are terms nonlinear in the electric field O(E?), and find for the above 
two equations 


[o — Ek — Zret] Cret =1 (8.282) 


ax ð 
ick (1 - > =") Vi + (Vk + V2 ret) = Ga = 0 (8.283) 


The first equation is easily solved, to yield 


Ge (K, ©) = + O(E’) (8.284) 


I 
© — & — Dret(K, o) 


The retarded Green’s function appears to have no first-order term in the electric field. It 
actually does, since the self-energy ~,., has a term linear in the field due to the electron- 
phonon interaction. This term is small and seems to have little effect. If it is ignored then the 
retarded Green’s function is unchanged from its value in equilibrium. This result considerably 
simplifies the solution to the QBE. The solution (8.284) also satisfies the equation (8.283). 
Related quantities such as the advanced function G,,, = G3, and the spectral function 
A(k, œ) are also unchanged to first order in the electric field. This completes the discussion of 
the retarded Green’s function in a static electric field. 
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Next the gradiant expansion is applied to (8.271) for the equations for G<. Again 
keeping only first-order derivatives, the final results are the quantum Boltzmann equation 


(QBE): 


l a\° 
o — & +> (v + eE Z) |e = G* Re[X ret] + 2 ret Re[G,et] 


8m IO 
ios ae Ios nm iÍ ) a) ` 
+72 ,G l-73 ,G |+7Œ (= D 
3G> aG 
+( a he | (8.285) 


{a ax, a1) _- 
apt MeV + eK: (1 -F2)M + (Vk +WD |} 


° dz OG < > < < > 
—ieK: A Vp RelG,] — ay Vk | = 2° G* —X°G 


+ i[Re[Z,], G<] + iE“, Re[G,«]] (8.286) 


Standard relationships have been used in deriving this equation, such as G, — G; = 2 Re{G,,,] 
and G, + G;=G<+G. 

Equation (8.286) is the quantum Boltzmann equation. It is rather formidable. It is also 
difficult to solve, since it is usually an integral equation. Sometimes it is nonlinear in the 
particle density because the self-energy functions £< and =~ are also functions of G< and 
G. 

The QBE also contains the functions G7 and &~. Similar equations for these functions 
can be derived by starting from the general equations (3.331) and (3.332). This derivation 
shows that the equation for G7 is almost identical to the one for G<. In fact, one can prove 
that the following identities are valid: 


G? = G5 — iA (8.287) 
E> = E< + 2i In{Y,,,] = E< — 2 (8.288) 


These relations are trivial to show for equilibrium, but they are also valid for nonequilibrium 
situations. These identities will be used often to simplify expressions. For example, the main 
scattering term in the QBE can be immediately simplified to 


EGS — EG = -ifr G“ — ESA} (8.289) 


This result will be employed in the calculations. The quantities G< and X~ are generally 
proportional to the density of particles, while retarded functions are only indirectly dependent 
upon the density of particles—only through the self-energy function È,«. In the QBE, each 
term has one factor that is either G~ or X<, so each term is proportional to the density of 
particles. This equation does have the character of a transport equation. 
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The QBE simplifies for the treatment of systems that are homogeneous (Vp = 0) and 
steady (0/07 = 0). These derivatives are dropped, as well as the Poisson brackets, since the 
latter contain similar derivatives. In this case the QBE is 


0m ð 
eE. (1 — Za)y, + (Vk + Vice ret) Z G* 


= [az aG ` ` ` 
— ieẸ- B V, RelGreil — 7 Viz | = E*A -rG (8.290) 


This equation can be simplified. The QBE is only valid to first power in the electric field, since 
terms of O(E°) have systematically been ignored. For example, in the gradient expansion the 
second derivative terms would give contributions of O(E”) were they retained. So the 
equation is exact to first order in the electric field, but is not exact to higher orders in the field. 
This fact is utilized to simplify the left-hand side of the equation. Since the field multiplies 
each term, on this side of the equal sign the Green’s functions can be taken to have E = 0. Of 
course, these expressions are just the equilibrium quantities in Sec. 3.7. 

Consider the frequency derivatives of the left-hand side of the equation. Write the 
spectral function in the shorthand notation 


— 27 
92 +7?’ 
The left-hand side will have three types of frequency derivatives: dn;/do, Əl /3w, 00/00. 


The coefficients of the latter two vanish, which leaves only terms in 0n,;/dq. All of the terms 
proportional to np vanish, leaving only terms multiplied by anp /30: 


Tr = — Im[È etl, o = 0 — & — RelÈ etl (8.291) 


ð 
A(k, ©) SE eE ` {vi + Vy Rela DI + oV,r} = E> G< — E<G (8.292) 


The factor of A(k, @)* appears in each term, and was taken outside. The left-hand side of this 
equation now contains only known quantities, which can be calculated in equilibrium. The 
scattering terms remain on the right-hand side. Finding them still involves work, usually in the 
form of an integral equation. This final form for the QBE is exact for transport which is linear 
in the field, and for steady state, homogeneous systems. It is quite analogous to the similar 
expression for the classical BE, which is 


of ® 
—eE: Via = X7 G* — X*G (8.293) 


The classical equation has df /da, while (8.292) has dn,;/d. 

Equation (8.292) is the steady state, homogeneous form of the QBE. It should be the 
starting point for many transport calculations. It is exact, and is an alternative to using the 
Kubo formalism, which is also exact. The derivation of this equation has been complicated, 
and has entailed some work. However, once derived, it is often the easiest starting point for 
deriving the transport coefficients. Calculations using Eq. (8.292) entail less work in getting 
to the answer than any other formalism. 


8.5.3. Electron Scattering by Impurities 


The quantum Boltzmann equation (QBE) (8.292) will be solved for the electron scat- 
tering by impurities. This case was solved in Sec. 8.1 using the Kubo formula. This exercise is 
useful for two reasons: (1) producing the known result demonstrates the correctness of the 
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QBE; and (2) this case is the easiest one to solve, and provides an introduction to the 
techniques for solving the QBE. The present example makes two assumptions that are 
intended to make the solution as easy as possible: (1) the impurities are dilute, so that the 
simultaneous scattering from several impurities can be neglected; and (2) the impurities have 
no internal degrees of freedom, such as spin or vibrations, which can be altered by the 
electron scattering. The second assumption implies that impurity is a simple potential that 
elastically scatters the electron. The impurities are randomly located in the solid. The method 
of Sec. 4.1.5 is employed for averaging over the random distributions. 

The first step in solving the QBE is to find the retarded functions. For impurity scattering 
they are independent of the electric field, at least for small fields. The retarded self-energy for 
scattering from impurities of density n; is given in Sec. 4.1 


Zelk, @) = 1; Ti, (@) (8.294) 


The T matrix is energy dependent, and hence off-shell. 

The next step is to express the self-energies £< and X7 in terms of the Green’s functions 
G< and G~. The retarded functions G,,, and ÈX e are known, since they are unchanged by the 
electric field. However, G< and G7 are, at this point, unknown since they are affected by the 
electric field. The self-energy functions are 


3 
B=", 0) =m | 2 TOO, 0) (8.295) 


This result is derived below. The off-diagonal T matrix is the one in Eq. (4.111), which also 
depends upon the energy œ. 

Equation (8.295) is now derived. An impurity at R = 0 is represented by an electron 
potential V(r), whose Fourier transform is V(q). The self-energy È from a single scattering 
event 1s 


Zox, x2) = V(r )8'( — X2)1 (8.296) 


where T is the unit tensor. This self-energy is inserted into (2.157). That equation is iterated in 
order to find the effects of repeated scattering from the same impurity. The resulting self- 
energy series is rewritten in a symbolic notation, where the product of two functions implies 
an integral over dx. Iteration of these equations gives the series for ©: 


È = Lolf + Goto + GoXy Gory + °°] (8.297) 
After summing this series, the matrix È is examined for its individual components. The ones 
for &*'* are 


yo = [1 + Giet>ret]20 [1 + 2 adv Cady] + Det 7 Lady (8.298) 


Note the analogy with the equation for G< in (2.159). The resemblance is expected, since the 
series for 2 has the same mathematical structure as the one for G. 

For impurity scattering, the unperturbed self-energies £67 = 0 are zero. There is only 
the last term in (8.298). Since the self-energies Xe and Z,4, are T matrices, (8.298) gives the 
result in (8.295). The damping function I (k, œ) and other self-energy £7 (k, œ) are 


3 
21 (k, ©) = n; s ITs A(p, @) (8.299) 


E> = 5< — 2T (8.300) 
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For dilute impurities, it is sufficient to retain only those terms that are first order in the 
impurity concentration n;. The broadening due to the spectral function A(p, œ) is from 
impurity scattering. Since the self-energy is multiplied by n, one can replace the spectral 
function A(p, œ) + 2n5(@ — £p). This expression is then equal to the imaginary part of the T 
matrix: 2I = n;v,07, where 6, is the total cross section from impurity scattering in Sec. 4.1. 

The starting point for solving the QBE is (8.292). On the left of the equals sign is the 
factor of 


[(Vk + Vj Re[Lre]V + oV, Mík, 0)? (8.301) 


This expression is simplified by neglecting terms of O(n;) such as I’ Re[%,,,]. The term in 
Oo = © — & — Re[È et] is small since the factor of A(k, œ tends to force o ~ 0. The above 
expressions can be approximated by v,IA(k, o): 


A(k, op CeO) 


eE. vI = 2il G5 —ix*A (8.302) 
These steps complete the derivation of the left-hand side of the QBE. 

On the right side of (8.302) the scattering terms vanish in equilibrium since 2~ = 2in-T 
and G< = in,-A. Since current is flowing in response to the field, the system is slightly out of 
equilibrium. The right-hand side is expected to have factors similar to those on the left. This 
discussion suggests the following ansatz for the nonequilibrium Green’s function 


G~ = iA(k, œ) nolo) — (=) eE -v, A(k, o) (8.303) 
The function A(k, œ) is unknown, and needs to be determined by solving the QBE. The 


factors that multiply A are for later convenience. The above choice does not make any 
assumptions for the value of A. Using this ansatsz in the self-energy function gives 


z5 = in |E Talao oiro- (ek -v ACP 0) 


d 
= 2il np — in; SEn, 2 IT xp! *A(p, o)eE+v pA(p, ©) (8.304) 


(27) 


Putting these two expressions into the right-hand side of (8.302), the equilibrium terms 
cancel, and the remaining terms each have the common factor of (dn-/do)A: 


d 
A? MeO) og ‘uy, = - (a acres “vA 


dp 
-n | 5% 5 ITip|’A(p, @)eE + v,A(p, @)} (8.305) 


After canceling all of the common factors, there is an integral equation for the unknown 
function A(k, œ): 


v,A(k, ©) = 5A(k, @)v, + Tip A(p, @)v,A(p, ©) (8.306) 


we | Se 


This integral equation for A(k, œ) is nearly identical to the one found in solving the 
conductivity from the Kubo relation. The quantity v,A(k, œ) is similar to the factor of 
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I'(k, œ — ið, œ + ið) in Sec. 8.1.3. As the derivation proceeds, the differences between these 
quantities disappears, and the two approaches give the same resistivity. 

After solving this equation for A(k, œ), the Green’s function G< in (8.303) is known. It 
is used in (3.340) for the current. The first term in (8.303) for G< gives a zero current. The 
second term gives a current proportional to the field E, and this proportionality defines the 
electrical conductivity 


dp [(® d d 
E A O O E 


The two equations (8.306) and (8.307) provide the solution for the conductivity from the 
QBE. 

For electrons in metals, with a spherical Fermi surface, the conductivity from impurity 
scattering is found easily from these equations. First, do the integral over dœ in (8.307). At 
low temperature the factor (—dn;/dq@) is nearly a delta function which sets œ ~ 0. At low 
temperature 


3 
yy = a [oe P” on’pv (Dp, 0)A(p, 0) (8.308) 


The equation is reduced as far as possible. Next one must solve the equation (8.306) for A. 
Adopt a vector coordinate system where the Z direction is k. Then the various scalar products 
of vectors can be found from the law of cosines: 


k- Ê= cos(8,), ke p = cos(8) (8.309) 
p+ E = cos(0,) cos(0) + sin(@) sin(6,) cos() (8.310) 


Equation (8.306) is multiplied by the vector E. The integral over the do part of d?p makes the 
cos(®) term vanish. Each of the remaining terms has a factor of cos(®)) which can be 
canceled. Then one finds the scalar equation 


2 
A(k, 0) = 4A(k, 0) + r2 5 |Tkpl4(p, 0)A(p, 0) cos(8) (8.311) 
The spectral functions A(k,0) and A(p,0) force k = p = kp. The factor of A(p, 0) under the 
integral can be set equal to A(kp, 0) and taken out of the integral. Then the above equation 


can be solved to find 
A(kp, 0) = a Ake, 0) (8.312) 


where t = 1/2I is the time between scattering from impurities, while q, is the lifetime which 
is important for resistivity; it has the factor of (1 — cos 0^: 


2 
_ net (8.313) 


m 
1 
a ns [oe $ [Tip Ap, w){1 — cos(8)] (8.314) 


This formula for the resistivity is identical to (8.25). The QBE gives the same formula for the 
resistivity as found earlier from the Kubo formula. The two methods are also in exact 
agreement for much more complex cases, such as electron-phonon scattering at nonzero 
temperatures and frequency. Most of the steps in the derivation were spent in getting to 
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(8.302). This equation is the starting point for any calculation for homogeneous systems. The 
number of steps between this equation and the final resistivity is small. The QBE is an 
efficient method of finding the resistivity or other transport coefficients. 


8.6. QUANTUM DOT TUNNELING 


8.6.1. Electron Tunneling 


Cohen et al. (1962) introduced the concept of the tunneling Hamiltonian, which has 
become universally adopted for the discussion of tunneling. Their idea was to write the 
Hamiltonian as three terms: 


Hr = E Tip Chore + he) (8.316) 
po 


The first term Hp is the Hamiltonian for particles on the right side of the tunneling junction. It 
contains all many-body interactions. Similarly, H; has all the physics for particles on the left 
side of the junction. These two are considered to be strictly independent. Not only do these 
two operators commute, [H,, Hg] = 0, but they commute term by term. The Hamiltonian on 
the right can be expressed in terms of one set of operators Cko and those on the left by another 
set Cpo» and these operators are independent {C,q, Cho} = 0. This assumption is probably 
reasonable. They further assumed that the tunneling is caused by the term H7 in (8.316). The 
tunneling matrix element 7, can transfer particles through an insulating junction. This 
transfer rate is assumed to depend only on the wave vectors on the two sides k and p and not 
on other variables, such as the energy or spin of the particles. 

The theory of electron tunneling was mainly applied to superconductors. It developed 
very rapidly and was entirely based on the tunneling Hamiltonian. The theory showed 
excellent agreement with the many experiments. The history books were written describing 
this satisfactory situation, and the scientists in this field wandered off to do something else. 
About this time there began a serious investigation, starting with Zawadowski (1967), Caroli 
et al. (1975), and Feuchtwang (1975), about the validity of the tunneling Hamiltonian. Of 
course it was found to be a poor approximation, since the tunneling rate depends on the 
energy of the particle as well as its wave vector. What does this turn of events do to the lovely 
agreement between theory and experiment for tunneling in superconductors? Actually, it 
probably changes none of it. The tunneling in superconductors takes place over a very narrow 
span of energies in the metal, 1.e., within a Debye energy of the Fermi surface. Also, all the 
electrons involved have their wave vector very near the Fermi wave vectors kp and ppr on the 
two sides of the junction. It is an adequate approximation to treat the transfer rate 7, as a 
constant Tọ which is evaluated at k and pp, because the variations in Tkp with energy must be 
on the scale E/E, which are negligible for E ~ A ~ 1 meV. Similarly, the variation of Tkp 
with p or k is on the scale of the Fermi wave vectors. One can treat the transfer rate Ty as a 
constant if the energies involved are small. The tunneling Hamiltonian is believed to be an 
improper formalism only when the applied voltages are large, say 1 eV. 

For quantum dots the energy scales are also rather small, and are near to the chemical 
potentials of the metal electrodes. Here the tunneling Hamiltonian is also thought to be a valid 
formalism. 
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The general model of a tunneling junction is shown in Fig. 8.12. It describes a non- 
equilibrium situation, since the chemical potential on the left-hand side u; is not the same as 
upg on the right. They differ by the applied voltage eV = u; — ug. The potential drop of eV 
occurs in the insulating region between the electrodes. 

The tunneling Hamiltonian (8.316) is used to derive a correlation function for electron 
tunneling currents. This correlation function has the form of a Kubo formula, except for an 
important difference. The Kubo formula for the conductivity in Sec. 3.7 expresses the ratio 
between the current and the voltage (actually electric field). In tunneling theory, the corre- 
lation function gives the current as a function of voltage. 

The tunneling current through the insulating region is expressed as the rate of change of 
the number of particles on, for example, the left-hand side of the junction N,. This rate is 
found from the commutator of N; = >> Che Cpo With the tunneling Hamiltonian. Only the 
term Hy fails to commute with N}, 


bo — Tkp Cho Cke! (8.317) 


N, = iH, N;] = iH, N;] = i X [Tip Cho 
kpo 
The total current J through the tunneling interface is defined as the average value of this 
operator: 


I(t) = —e(N,(t)) (8.318) 


The average value of (N,(t)) is obtained by following the same steps used to derive the Kubo 
formula in Sec. 3.8. The total Hamiltonian is written as H = H’ + Hr, where H’ = Hp + H. 
Go to the interaction representation, where the tunneling term Hp is treated as the interaction 
and everything else H’ is Hy. Then the S matrix is expanded in terms of the perturbation Hy. 
The objective is to obtain a formula where J « |T kpl”, so only the first term needs to be 
retained in the expansion of the S matrix. These steps bring us to the formula 


I(t) = —ei| dt (NL, Hr(t’)]) (8.319) 
H(t’) = e” " Hype" (8.320) 
N,(t) = HN e”! (8.321) 


where the time dependence of H7(t) and N z(t) is governed by H”. 

An important step in the calculation is to insert the chemical potentials u; and upg for the 
two sides of the junction. This insertion must be done with more care than usual, because the 
chemical potential is not the same on the two sides of the system. The initial Hamiltonian 
(8.316) has been written to not include the chemical potentials, so the energy is measured on 


FIGURE 8.12 Tunneling between two normal metals. The arrow shows the electron path through the oxide interface. 
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an absolute scale rather than relative to the chemical potentials. However, now insert the 
chemical potentials into the time developments, so that the energy can be measured with 
respect to the different chemical potentials on each side of the tunnel junction. The symbols 
Kp and K; denote the Hamiltonian with respect to the respective chemical potentials: 


Kp = Ap — UpNp (8.322) 
K; — H; — HUN; (8.323) 


For a free-particle system, Kp = ) E.Cl Cko; while Hp = >> MOR since Ëg = Ek — HR- 
Since the number operators commute with H’, it is possible to write H’ = K’ + RNg + UN; 
and exp(iH’t) = exp(ikK’t) exp[it(u,N; + UpNp)| since the exponentials can be separated 
when the operators commute. The time development of Hp is 


rq ayy ler: o; agi 
H,(t) — eH ‘Hre iH't __ eK [e CNi tN) He ith NRHN) Je itK 


— Pigi L | Tipe CoCo + elt HT Cho Cro |e 
po 


The commutator of Hr with the number operators produces the factor u; — Hyp = eV, which 
is identified as the applied voltage, as in Fig. 8.12. The applied voltage appears in the 
correlation function. The correlation function will now be evaluated by assuming that both 
sides of the junction are in separate thermodynamic equilibrium. 

The current operator in (8.319) now becomes 


t 


I(t) = el 


—& 


ar( E ITeC — Tipe” COCO] 
po 


» [Tepe Quy f) Cpe) + RO) 
k'p’o’ 


From now on the time development of Cpo operators is governed by C,,(¢) = eKr Cy eRe! 
and Cpo operators by C,,(t) = e*!'C,,e~*"". Define the operator A as 


ACD = X Tap Chal Cpl (8.325) 
po 


and the current is written as the summation of two currents 


I;(t) = e| dt @(t — tye” (A(t), AEN — VO AO, A} (8.327) 


L(t) = e| d'Olt — eV +9 AA, ACN) — TAA, AE (8.328) 


—00 


The term Zç is for single-particle tunneling, which is important for quantum dots. The other 
term J, describes the tunneling currents associated with the Josephson effect in super- 
conductors. It is evaluated in Chapter 10. 
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The terms in J, have just the right combination of factors to be a retarded Green’s 
function. The integrand depends only on the difference of(t — t’), so set ¢’ = 0: 


Ual) = iOH, A" (0)]) (8.329) 
U-e(—eV) = D dte U (O) (8.330) 


The second term in J, is the Hermitian conjugate of U,«(—eV) except for a sign. But the 
Hermitian conjugate of the retarded function is just the advanced function. The single-particle 
tunneling current is written as; 


Iş = ie[Use(—eV) — Usay(—eV)) (8.331) 
= —2e Im[U,,,(—eV)] (8.332) 


It is twice the imaginary part of a retarded correlation function, which has the form of a 
spectral density function. Using the relationship between the Matsubara and the retarded 
correlation functions discussed in Sec. 3.3, the way to calculate the single-particle tunneling is 
to evaluate in the Matsubara formalism the correlation function 


B 
U(iw) = -| dre™®™" (T_4(t)A'(0)) 
0 


bp 
p -È 2 Nip Typ’ | dre!" (T, Cho (1)Cpol1)C} o Cro) (8.333) 
po k’p'o’ 


The single-particle tunneling current is just the spectral function of this operator evaluated at 
the real frequency —eV /h, as shown in (8.330). The Matsubara frequency @, = 2n7/B is 
boson, since the correlation function has pairs of fermion operators. In the tunneling 
Hamiltonian, the nght- and left-hand sides of the tunneling junction are independent. Then 
the correlation function factors into a product of the Green’s functions for the right and left 
sides of the junction: 


BO 
Ulio) = X Tl] dre!" (T.Cyo(0)Chg(t)) (T; Cpo(1)C} (0) 
po 


B 
7 Žž Tebl dre™'Gpg(k, —1)G,(p, 1) 
po 


l o, , 
= x Tip!” = Fak, ip — 10)G,(p, ip) (8.334) 
po ip 


The only terms which enter are those for k = kK’, p = p and o =’. A type of Feynman 
diagram is shown in Fig. 8.13. The symbols T in circles are the tunneling vertices. The 
vertical dashed line is meant to divide the right from the left side of the junction. The solid 
lines are the interacting Green’s functions Yp and G,. 

The correlation function is the product of the two Green’s functions for the two sides of 
the junction. Such a simple result is obtained only when one can neglect the vertex diagrams 
of the correlation function. The argument for neglecting them in tunneling is that vertex 
corrections require some interaction between electrons on the two sides of the junction, which 
is improbable. Actually, there is another case where vertex corrections are needed in 
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° 


FIGURE 8.13 Feynman diagram for tunneling, where the 7s are the vertices which link the nght (R) and left (L) 
sides of the junction. 


tunneling, and that is to account for processes where the tunneling electron can excite 
vibrations or other excitations in the interface. That happens in quantum dot tunneling. 

The first example of solving (8.334) will be for the tunneling between two normal 
metals. The experiments show the tunneling is linear in voltage V for small values of V and 
becomes slowly nonlinear at larger values of V The latter behavior is a failure of the tunneling 
Hamiltonian, which predicts only linear behavior. The present discussion is limited to small 
voltages, for which the tunneling Hamiltonian formalism is valid. 

For a normal system, consider the electrons to be simple quasiparticles, and approximate 
the Green’s functions by go and gv., The Matsubara summation is then familiar: 


l Or onana a oa — MFG) = np (Ëp) 
p> (p, ip)G, (k, ip — iœ) = -oE E E E (8.335) 
2 Mr(S) — nr (6p) (8.336) 


Ulio) = 2 Ip io + Ék — Ep 


I = —2e Im[U,.,(eV)] 
= 4re 2 Tip! nrk) = np (&p)lòleV + Ék — Sp) 
p 


The energies €, and occupation factor np(&x) refer to the right side of the junction, and ¢, and 
n-(§,) refer to the left side. An additional factor of two in front is for the summation over 
spins. The spin is usually preserved in tunneling. The summations over k and p are just over 
wave vectors. For small voltages, only electrons very near to the Fermi energy are involved in 
tunneling. For these small energies, the density of states on both sides is assumed to be a 
constant: 


dk 

polao | s337 
dp 

D> | cos Mi | dë, (8.338) 


I = 4neN,N, |T} l dë, | declare — np(&)I(CV + Er — &) 


= 4neNgN ITP | drin En) -nr Gr + eV (8.339) 
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At zero temperature, the occupation numbers are step functions, so (8.339) becomes 


0 


I = aneNgN,ITP| dep = CoV (8.340) 
V 


—e 


Oy = 4me7NRN, |T|? (8.341) 


The tunneling device behaves as a simple resistor, with a conductance (inverse resistance) 
given by 6p. With a little more care, one can show that the integral in (8.339) also equals eV at 
nonzero temperatures, since the thermal smearing cancels between np(&) and n-(& + eV). 
The result is temperature independent, at least for a range of low temperatures kpT < Er. 

A formal expression for the tunneling current can also be derived when interacting 
Green’s functions p, Y, are retained in the Matsubara summation. This type of summation 
was evaluated in Sec. 7.1 using the Lehmann representation 


1 
X (iœ) = qo F(p, ip)G p(k, ip — iœ) 
Ip 


deg deg 
- (ae. ez) [S24 ep)S (8.342) 


l l l _ nr (Er) — nr (€r) 
B ip ip — Ez ip — iW — Ep id) + Ep — Ez 


(8.343) 


The retarded function is obtained by the analytical continuation ia — eV + id and then take 
the imaginary part. 
dé 


R AL(p, Er + eV)Ag(K, Eg)[Np(Eg) — np (Er + eV )] 


-2 malen = |E 


These steps bring us to the formula for the tunneling current of Schrieffer et al. (1963): 


de 
I = 2e 2, Tip? (Zaue Er + eV )AR(K, Ep)lnr (Er) — nr (Er + eV)! (8.344) 
p 


Again a factor of two is added for the spin summation. The tunneling current is expressed in 
terms of the spectral functions on the two sides of the junction. Equation (8.344) is the exact 
formula for J within the model of the tunneling Hamiltonian. The earlier expression (8.339) 
is recovered with the free-quasiparticle approximation Ap = 2nd(€p —&,) and A; = 
2nd(e, — Ép). The virtue of (8.344) is that it contains all many-body effects on the two sides 
of the junction. Its drawback is that it is based on the tunneling Hamiltonian formalism. 

If the tunneling matrix element Tkp can be approximated as a constant, then the 
summations over wave vectors give the interacting density of states 


5 Ar(k, Ep) => 2TnNpR(ER) (8.345) 
k 
> ALP, €L) = 27N; (Ez) | (8.346) 
p 
I = 4nelT|’ | aenn, +eV)inp(£)— npe +erv)] 


which is a useful formula for describing many-body effects in tunneling which occur on a 
small energy scale near to the Fermi surface. 
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A useful formula for quantum dots is to calculate the rate of electron tunneling in each 
direction: left-to-right (“LR”) and right-to-left (“RL”): 


de 
Íg, = 2e 3 |Tkpl" (Bao Er + eV )AR(K, Er)nr (Ep1 — nr (Er + eV) 
p 


V 
~ Ane|T|?NpN, | aenn —nr(e+eV)] = — (8.347) 
de 
Tip = 2e 3 Tp |” [Bao Er + eV )AR(K, Eg)nr (Er + eV)[1 — np(Ep)] 
p 
> OoV 
~ 4ne|T| Np N; denp(E + eV)[1 — np(£)] = Br] (8.348) 


Íri — Irig = OoV (8.349) 


The previous result is obtained by subtracting the currents in the two directions. 


8.6.2. Quantum Dots 


Figure 8.14 shows the typical geometry with a quantum dot (QD). There are two metal 
electrodes on the right and left, and a spherical metal particle between them. The sphere is 
imbedded in an insulating material through which the electron can tunnel. The tunneling of 
electrons from the right to the left electrode 1s facilitated by the QD. The tunneling becomes a 
two-step process: the electron tunnels from one electrode to the QD, and then tunnels from the 
QD to the other electrode. There is also coherent tunneling from one electrode to the other 
through the states of the QD, which act as virtual intermediate states. The first experiments of 
this type were done by Giaever and Zeller (1968), and they have become popular recently. 
Now the QD and electrodes might be composed of semiconductors, where the nanostructure 
is made by lithography. 

First examine the properties of the QD. Assume it is a sphere of radius a in atomic units: 
the dimensional radius R = aay where ay = 0.05292 nm is the Bohr radius. The volume of 
the sphere is Vy = 4nR>/3. The number of electrons is N =njV), where the density 


FIGURE 8.14 Geometry for quantum dot tunneling. 
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No = k}./3n* is assumed to be the same as in the bulk metal. The density of states per spin is 
Np = mk; /(2n7h*). The quantity Np V, has the units of inverse energy: it is the number of 
electrons per unit energy. Conversely, its inverse is the average separation between energy 
levels on the QD 


1 E 


AE = —— = 3r ——_~> — 8.350 
Nr Vo n (krag)a? l ) 
e E 
U = eV, = — = 8.351 
ere OR a (8.351) 


The quantity U is called the charging energy. It is the energy required to add a charge of e to a 
neutral metal sphere. All of the charge collects on the surface, and the energy is just 
U = e*/2R. Table 8.3 compares these two energies as a function of sphere radius. The 
energies are given in terms of electron volts. The two energy scales are interesting. The 
charging energy is large compared to the thermal energy kgT, regardless of which temperature 
is used. If n is the number of electrons on the QD for it to be charge neutral, then the energy 
for n electrons on the QD is 


E, = AE(n— ñ) + U(n— ny’ (8.352) 


The system behaves as an Anderson model, in that the energy of the QD depends quad- 
ratically upon the total number of electrons n. The energy is for a many-electron system. The 
energy U is due to electron—electron interactions. A many-electron description is required for 
treating the QD. 

The other energy AF is also interesting. It is much smaller than the charging energy for 
typical systems with N ~ 10-1000 electrons. Here the relevent question is whether AF is 
greater or smaller than the thermal energy kgT. Of course, this comparison depends upon the 
temperature of the experiment: whether it is at 300K, 1K, or 1mK. If BAZ < 1 then the 
energy levels on the QD can be treated as a continuum, and no error is made by replacing the 
summation over states by a continuous integral. However, if BAE > 1, the states are well 
separated in energy compared to kgT. The summation over states must remain a discrete 
summation. Either condition could apply, depending upon the size of the QD and the 
temperature of the experiment. Different theories in the literature assume one case or the 
other. Some simple statistical calculations are presented of the fluctuation of the number of 


Table 8.3 Number of electrons N, charging 
energy U, and energy level separation AE of a 
quantum dot of radius a in atomic units. 
Energies in units of electron volts. A value of 
r, = 3.96 (sodium) is assumed 


a N U AE 
5 2 2.72 2.116 
10 16 1.36 0.265 
15 54 0.91 0.078 
20 129 0.68 0.033 
25 252 0.54 0.017 
30 435 0.45 0.010 
35 690 0.39 0.006 


40 1030 0.34 0.004 
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electrons on quantum dots. These fluctuations are important for controlling, or else not 
controlling, the flow of current onto the quantum dot. These fluctuations play a role in the 
phenomena called “Coulomb Blockade”. The two different formulas in the literature for the 
tunneling rate are obtain as two limits of the fluctuations. 

The statistical averaging over the states n of the quantum dot includes both the single- 
electron states (n — A)AE and the charging energy U(n— A}. Sometimes they can be 
decoupled. There are, in fact, many configurations of one-electron energy states. Denote their 
quantum numbers by a, so that the energy without the charging energy is E,. 

Calculate the partition function for the electrons on the quantum dot. Assume there are a 
number of single-electron energy levels with energy E, as well as a charging energy 
U(n — ny . Let n, be zero or one, and it denotes whether an electron is, or is not, in the state «. 
Then the total energy of the system is 


n=n, (8.353) 


E=Y-n,E, + U(n— ny (8.354) 


The label « includes the spin and other quantum numbers. Many combinations of states may 
have the same value of n. In this case, the partition function can be written exactly as 


21 
_ 0. l , 
Z — Soe BU) | 2 on, M (8.355) 
21 
2 (7 dO 
= re BUCA | = "S (O) (8.356) 
S(®) = IL, [1 + e PA: -*] (8.357) 


For 8 = 0 then S(0) is just the usual Fermi—Dirac distribution function. The integral over 0 
selects out the terms with a certain value of n. If U = 0 then the summation over n gives a 
delta function at 8 = 0 and the partition function reverts to being a usual product of Fermi- 
Dirac distributions for noninteracting electrons. 

The role of the fluctuations is determined by the function S(8). This function is evaluated 
by breaking the summation over « into two parts depending upon whether €, = E, — pu is 
positive or negative, i.e., whether exp(—B6é,) is less than or larger than one. 


In(S(8)) = F Infe~®(e® +e] + TT In ja + efy(1 + ie) 


Zo So 1 + e BS. 
= In[S(0)] — i071 + 8S(0) (8.358) 
i= 1 (8.359) 
Gx <0 
Ny = z = np (6g) (8.360) 
SS(0) = 3 Inf + (1 — ñ,)(e® — 1)] + X m + ñ,(e™® — 1)] (8.361) 


led ot 


where n is the number of electrons below the chemical potential, and n, is the Fermi—Dirac 
distribution of electrons for the state a. 
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First evaluate òS for the case that the energy separation between the states of energy E, 
is less than the thermal energy kgT = 1/B. Then the summation over « can be replaced by a 
continuous integral. Let N-(E)Vo = Pp(6), Pp(0) = 1/AE be the density of energy states on 
the dot. Then one gets that 


E = Vo | Neus = | depo (8.362) 
85 =|) depo @ inl Hne? = DIH | aero Cnt Hae- 1} 8363) 
ôS = — I afl — cos(0) (8.364) 
5S x -aii — cos(0)] (8.365) 

y = kT pp(0)In2 = “8 In(2) (8.366) 


The integral has been approximated by expanding the log function in a Taylor series and 
keeping the first term. Higher terms (7 > 1) in the series for 6S are much smaller: the series 
converges rapidly. In general y > 1 means that kT > AE. 

With this approximation the partition function can be expressed in closed form 


Zrp = H,[1 + e P&] (8.368) 
Zo = TH Neh” (8.369) 
Vv 
V=n—n (8.370) 
2n 
I (Y) — | dÒ _ive-2y(1-cos() (8.371) 
0 21 
I 
=e] (2y) © me (8.372) 


This formula is an important result. It shows that the Fermi—Dirac statistics of the states 
labeled « are decoupled from the charging of the QD, which is given by the last term Zc. 
When y > 1, then for small values of v the factor of ¥,(y) is independent of v, and the 
averaging over the value of n involves only the term exp(—BUv’). This result is assumed by 
most theoretical papers. The charging is treated as a separate statistical process from the 
Fermi—Dirac statistics of the electrons on the QD. 
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The other case is to evaluate 6.$(0) while assuming that the separation between energy 
levels is greater than the thermal energy. In that case this function 1s evaluated as 


SS ~ —y<(1 — e®) — y? (1 — e™®) (8.373) 
y“ = 2 np(—6a) (8.374) 
y7 = 2 nr(§,) (8.375) 

l >\ v/2 
gs (L) LOT (8.376) 
Z = ZppZe (8.377) 
Ze = yi g e PUY (8.378) 


Here y < 1. The formulas for the two cases become identical when y = y‘ = y7. 

However, if y < 1 then Y,, does depend upon y and this factor should be included in the 
analysis. Our derivation shall assume that y > 1, so that the energy levels on the QD can be 
considered to be continuous. The thermodynamic averaging over the charging energy is 
accomplished by including the factor of exp(—BUn7). In this case, the system is a type of 
Hubbard model. 


8.6.3. Rate Equations 


The Coulomb blockade is derived using a set of rate equation, which describe how an 
electron hops on or off the QD. Define f, as the probability of there being n electrons on the 
QD. If the number is 7 in equilibrium, then we expect the probability to be something like 


f, = -expl -Bin — 7)A — Bn — APU] (8.379) 
Zop 


l= f, (8.380) 


where A is the energy to put a single electron on the QD, irrespective of the charging energy. 
It is convenient to change the notation and to define n’ = n — n as the number of excess 
electrons (n’ > 0) or holes (n’ < 0) on the QD. Similarly, the label on the site probability is 
changed to fy. The subscript denotes the deviations from neutrality. 

Define R;(m, n) as the probability that the number of electrons on the QD changes from 
n to m by a jump to the electrode to the left. Similarly, Rp(m, n) is the probability of going 
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from n to m by a jump to the electrode on the right. Since only single jumps are considered, 
then m = n + 1. These quantities are given earlier as Ip, zg in Eq. (8.347) and (8.348). 


_ SE y(n) 
R;(n+ 1, n) = BEO — ] (8.381) 
— 6r£(n — 1) 
R,(a—1,n)= 7 o EGT (8.382) 
_ CrEp(n) 
Rp(n+1,n) = SBE) — ] (8.383) 
_ CrER(n — 1) 
Rg(n — 1,n) = 1 o Bim (8.384) 
SRL = 4n\T, rl NirNop (8.385) 
E,(n) = eVop + Un + 1) (8.386) 
ER(n) = eVop — V) + U(2n + 1) (8.387) 


The left electrode is taken to have zero voltage, while the right one has a voltage of V The 
quantum dot has a voltage of Vgp. The voltage differences Vọp and Von —V are the 
differences in the chemical potential between the left electrode and the QD, and between the 
QD and the right electrode. The factor of (2n + 1) = (n + 1)? — r? is the difference between 
the charging energies before and after a hop that adds an electron to the QD. The factors of 
NL, Nr, Nop are the density of states at the chemical potential for the left and right electrodes, 
and for the QD. 

As an example, consider the subsystem of just the QD and the left electrode. The rate 
equation for the process of having electrons hop between the QD and the electrode is 


Es = —fhlRi(n + 1,n)+ Rin — 1, n] +f Rn, nt 1) +f, R(n,n— 1) (8.388) 


The first term on the right of the equals sign is from events where the system starts in state n 
and changes by having an electron hop either on to or off of the QD. The remaining two terms 
are events where the QD starts with either n + 1 electrons and gets to n by a hop. In equi- 
librium, the average rate of change is zero. Set the right-hand side of the above equation to 
zero. This step can be accomplished with the identities 


fF, Rin + l, n) = fayı Rin, n + 1) (8.389) 

fı Rin, n — 1) =f, R (vn — l.n) (8.390) 

Adding these two equation gives the right-hand side of (8.388). However, the above two 
equations are actually identical. The second is the same as the first by changing n > n — 1 


everywhere. Only consider the solution to the first equation, which can be manipulated to 
write it as 


R l, 
fess c RED = exp[-BE, (i) (8.391) 


Sn = Jo exp[—eVopn — BUn?] (8.392) 


The second equation above is the solution deduced from the first. The result is expected for 
thermal equilibrium. 
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The case for the QD tunneling is different. There is a voltage V across the device, and a 
current flowing, so that the system is not in thermal equilibrium. The relationship among the 
factors of f, must be derived anew. The rate equation for this case, including both electrodes, 
1S 


d 
Tn AIR +1,1) +R; (n — In) + Rel + 1n) 
+ Rg(n ~ l, n)| + fro lRON, n+ 1) + Ran, n+ 1)] 
+f,[R,@™,n — 1) + Re(n, n — 1] (8.393) 
In steady state, the average derivative is zero, and the right-hand side vanishes. As in the 
above case, this constraint is accomplished by 


FAR + 1,n) + Ran + 1, n) = fry [Ri n + 1) + Ren + 1) 
Jasi Ri(n+l,n)+ Rp(n+ 1,n) (8.394) 
fn = Ri(n,n+1)Rg(n,n+1) 


The above equation determines the occupation numbers f, for the quantum dot. Another 
important equation is for the current. One can count either the electrons leaving the first 
electrode, or else those arriving at the second one. These two expressions for the current are 


I, =e} frlRi(n + 1,n)— R(n- 1, n)] 


=e) [f,R,n+1,n)—fi4 Rn, n+ 1) (8.395) 
Ip = eX f[Re(n — 1, n) — Ren +1, n) 


=e) [fiir Re(n,n+1)—f,Re(n + 1, n)] (8.396) 


In the second equal sign we changed n —> n + 1 in one of the terms, which is permitted since 
there is a summation over n. In steady state the two currents are identical: J; = Ip. This 
identity is automatically satisfied by Eq. (8.394). 

As a simple example, consider the case that the charging energy U = 0. In that case the 
solution to the rate equations is that 


ELEL = —SrEp (8.397) 

EreVop = —Ere(Von — V) (8.398) 
_ R 

Vop = Ve +E, (8.399) 


Putting this result into Eq. (8.394) gives that the ratio f,,,/f, = 1. All of the occupation 
numbers f, are the same. An evaluation of the current gives that 


I= ey eR (8.400) 

ČL + ËR 
This result is the same as that obtained by two resistors in series. A simple tunnel junction acts 
like a resistor. A series of two junction behaves as two resistors in series. The net resistance is 
the sum of the two resistances. In the present problem, the resistance for each connection are 
rer =1/ (et LR). The above formula shows that the net resistance of the two-step tunneling 
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is the same as adding the two resistances. This result is expected once the energy levels in the 
QD are treated as continuous, and the charging energy U is neglected. 

The Coulomb blockade is obtained by including the charging energy. The example is 
simplified by assuming that the two electrodes are identical, so &; = Ep. Furthermore, it is 
assumed that the QD is exactly midway between the two electrodes. Then one must choose 
Vop = V/2 by symmetry. For each value of V the ratio (8.394) is solved for the relevent 
number of n-values. Define this ratio as r(2n + 1). By normalizing all occupation numbers to 
Jo, we find for the partition function and the current 


Zop =f: +r(DUL +r +- 


1 1 
Ha f tlt )| (8.401) 
T = 2 (8, (1, 0) — Ry(-1, 0) + r(DIR,@2, 1) — RL. D] 
OD 
1 
top EO —1)— R,(-2, =D] + ++- | (8.402) 


A typical result from this formula is shown in Fig. 8.15. The current is an asymmetric 
function of the voltage. The onset of current begins at e|V| = 2U since the current can only 
occur if e|V|/2 > U. Electrons can only hop from either electrode on to the QD if the voltage 
difference supplies the charging energy U. Since the QD is in the middle of the space between 
the electrodes, a voltage of V gives only V/2 between the QD and either electrode. Note that 
there is a second weak threshold when e|V|/2 =3U at e|V| =0.6eV. This point is the 
threshold for having two extra electrons on the QD: the factor of three is 2? — 1°. 

Since the geometry is perfectly symmetric, the current can also proceed by the 
generation of holes (n < 0). In this case, the first step is for an electron to hop from the QD to 


0.5 


-1 -0.5 


0 0.5 
V(volts) 


FIGURE 8.15 Current—voltage characteristic for a quantum dot placed symmetrically between two electrodes. 
U =0.1leV and T = 10K. 
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the left electrode, making a hole on the QD. The second step is for an electron to tunnel from 
the right electrode to the QD, which fills this hole state. 

This example is typical for the case that the QD is large enough that its electron states 
can be considered to be continuous. A much different case, for a small QD, is when the 
energy levels are large compared to the thermal energy. Then the conductance is a series of 
steps, where each step is another available energy state on the QD. 

An interesting experiment has a magnetic impurity in the QD (Goldhaber-Gordon et al., 
1998). The tunneling becomes a type of spectroscopy which can measure the density of states 
of the magnetic impurity. The experiment allows a test of solutions to the Anderson model. 


8.6.4. Quantum Conductance 


Ballistic transport is the case where an electron carries current without any scattering 
(Landauer, 1981, 1989). There are several relevent length scales: the length L of the sample, 
and the mean-free-path À of the electron. Earlier in this chapter it was shown there are 
separate values of the mfp for: momentum scattering, energy scattering, electron—electron 
interactions, etc. Here the most relevant mfp is that for momentum scattering (A,), since it 
enters into the definition of the electrical conductivity. If L < À, then the current is carried 
across the material without much chance of scattering. In this case the transport is called 
ballistic. 

A special formula is needed for the current from ballistic transport. One cannot employ 
the Boltzmann equation or the Kubo formula, since they all assume a system with quasi- 
equilibrium, which means that the mfp is smaller than the dimensions of the sample. Instead, 
the correct picture is from simple quantum mechanics. Consider a particle moving to the right 
in one dimension, with a wave vector k. The number of particles per unit distance with wave 
vector k is 


dk 
5 te (Ex) (8.403) 


The Fermi—Dirac occupation number is for the source of electron on the left electrode. 
Assume there is another electrode on the right which is at a voltage V with respect to the one 
on the left. Then the number of electrons going from right to left is given by 


k 
Z nr +eV) (8.404) 


The net current is obtained by subtracting these two formula, and multiplying by ev, where v; 
is the velocity 


1 = É [doD — meu + eV) (8.405) 

= 5 | Abul Ee) -aré + eV) (8.406) 

1 =0,V (8.407) 
e? 

Oy = — (8.408) 


The quantity e?/h is the quantized conductance. Its units are interesting. In cgs units, e*/hc is 
dimensionless, so that e*/h has the units of velocity. That is not the correct result here. 
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Instead, in SI units e has the units of Coulombs, and Planck’s constant has the units of Joule- 
second, which gives that 6, has the units of Siemens, which is the inverse of the Ohm. The 
approximate value is 6) = 38.8 LS. 

The above derivation applies to a single channel of electrons. Consider the conduction 
down a quantum wire. Treat the wire as a type of wave guide, which has many conducting 
modes, although each new one occurs at higher energy. The number of conducting modes as a 
function of energy is defined as N.(£). This quantity is a series of step functions as each new 
mode opens up along the quantum wire. Then the current is actually given by the expression 


I =20,NA(eV)V (8.409) 


The factor of two in front is for spin degeneracy. Each orbital mode usually has spin 
degeneracy. This symmetry is broken in a magnetic field. Some writers prefer to define 269 as 
the quantum of conductance. 


PROBLEMS 


1. Use the piezoelectric electron-phonon interaction in Sec. 1.3 to calculate the temperature 
dependence of the electrical conductivity in a semiconductor. 


2. Consider the self-energy of an electron from unscreened exchange, 
l , 
2p) = 7-2 v 2 FO + a, ip) (8.410) 
BVT D 


which was evaluated in Sec. 5.1.6. Derive the Ward identity for this self-energy—the equivalent of 
(8.81). Use this result to show that the Coulomb ladder diagrams have negligible effect upon the basic 
polarization P)(q, iœ) in the limit where q — 0. 


3. The correlation function 


Boo 
x(q, io) = -| dre™iT,pla 1)p(—q, 0)) (8.411) 


vanishes when q — 0. Show that the bare bubble P“)(q, iw) has this feature. Use the Ward identity to 
show that it still vanishes when self-energy functions and vertex functions are included in the evaluation. 


4. Derive the rate of temperature relaxation in a semiconductor due to the deformation potential 
scattering of electrons by acoustical phonons. Give the answer for high and low temperatures. Evaluate 
the numerical value of the lifetimes using the data in Table 8.1. 


5. Use polar coupling to optical phonons to calculate the rates of relaxation for an electron in a 
semiconductor: (a) for momentum, (b) for energy, and (c) temperature. 


6. Use deformation coupling to optical phonons to calculate the rates of relaxation for an electron in a 
semiconductor: (a) for momentum, (b) for energy, and (c) for temperature. 


7. Use (8.228) to evaluate y(e + ið, e + ið), and show that it equals unity, in agreement with the Ward 
identity. 


8. Derive the Ward identity (8.219) for the phonon ladder diagrams. 
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9. Solve the scalar vertex equation (8.219) using the same techniques used to solve 
y(p, € + ið, € + ið). Show that T (e + ið, € + ið) = 1 — 0%/de but that P(e + ið, e — ið) does not obey 
the Ward identity. Furthermore, show for every solution R(¢) = I'(e + ið, e — id) to its vertex equation 
that R(eé) + a Im[X(e)] is also a solution, where a is an arbitrary constant. 


10. Write down the collision rate (df/dt) for electron—electron interactions. Evaluate it for a 
semiconductor where the few conduction electrons obey Maxwell—Boltzmann statistics. Use Debye 
screening for the dielectric function. 


11. Show that the leading term in the thermopower for a free-electron gas is 


nek? T 
S=——_3 8.412 
2eEp ( ) 


12. Evaluate the correlation function LÊ? for the thermal conductivity of a metal. Use the free-electron 
heat current operator, and show that the integral diverges for the first term in the vertex summation. 


13. Solve Eqns. (8.215) and (8.235) in the limit of small temperature and show that the resistivity 
p ~ T° from phonons. Assume that «2.F(u) = gu% at small u, where g is a constant. 


14. Compare the relaxation time for transport deduced from the force-balance theory (8.198) with that 
given by (8.236). How do they differ? Show they become identical in the limit of high T. 


15. Write down the rate equations for current between two electrodes separated by two identical 
quantum dots in series. What is the minimum voltage for current to flow at zero temperature? 


Chapter 9 
Optical Properties of Solids 


9.1. NEARLY FREE-ELECTRON SYSTEMS 


9.1.1. General Properties 


This section is concerned with the optical properties of free-particle systems, such as elec- 
trons in metals or semiconductors. A completely free-electron system does not absorb light at 
all, so that its optical properties are uninteresting. The ability of the nearly free-particle system 
to absorb light is due to its imperfections or deviations from homogeneity. If these effects are 
small, then so is the light absorption. This situation is described here. The optical properties 
are calculated for simple metals and semiconductors. The best way to do this is using the so 
called force—force correlation function. 

First it is useful to recall some of the fundamental formulas regarding the optical 
properties of solids. As shown in Sec. 4.5.3, the transverse dielectric function of the solid in 
the limit of long wavelength is 


2 


e (@) =1 -2% _ Tro) (9.1) 
1, 
nlio) = —— | dre T, JO (9.2) 
0 


where 1(@) is the retarded form of the current—current correlation function n(iœ). The optical 
wave vector k is so short that the correlation function is evaluated in the limit where k —> 0. 
The results for k — 0 are sometimes different from those for k = 0, in which case the limit 
result is appropriate. The k symbol is omitted in the notation. The system is assumed to be 
isotropic. The index p is any direction (x, y, z), since the average value of (/,,7,) is equal to 
that of (j - j)/3. The conductivity o(@) of the solid is given by 


e i(@) = 1+ = (9.3) 
ine in(@) 


580 Chap. 9 e Optical Properties of Solids 


with the usual identity Re(o) = —Im[n(@/o]. The relationships are also valid for the real part 
of =(@) which contributes to the imaginary part of the conductivity. These general formulas 
will be the basis for most of our discussion. 

A general theorem which can be proved is the f-sum rule, 


ei (Œœ) = €,(@) + is (œ) (9.5) 
oe Ta 
l odos (œ) = > (9.6) 
0 


The similar theorem for the longitudinal dielectric function was shown in Sec. 5.7. The 
theorem for the transverse dielectric function can also be written in terms of the conductivity 
since £, = 4r Re(o/@): 


dw Re[o(@)] = 
0 2m 


OO 

| Tye? 0.7) 
The real part of the conductivity is the absorbing power of the solid. This theorem states that 
the absorbing power depends only on the density of particles nọ and not on the detailed form 
of the interactions. Many-body theory can rearrange the absorbing power by moving it from 
one part of the spectrum to another, but it cannot alter the net amount. The statement in the 
opening paragraph, that the light absorption is small if the perturbations are small, needs 
clarification. Insight is provided by examining the Drude formula. 

The Drude formula is an empirical relationship which is found to be valid, over a 


restricted frequency range, in most nearly free-electron systems. It states that 


2 
Noe 1 
=- _ 9.8 
no) m 1 —iwty 0-8) 
Nye? To 


Re[o(@)] = (9.9) 


m(1 + œt) 
The quantity Tọ is a relaxation time, which is usually taken to be a constant. Assume that the 
Drude formula is valid for all frequencies and with a constant relaxation time. Then the sum 
rule is satisfied independently of the size of the relaxation time: 


cO 2 OO 2 
l do Re{o()] = Nge To l do TMe 
0 


— = 9.10 
o L+(@ty)y 2m em) 


m 


Re[o(œ)] is a Lorentzian distribution, whose width is 1/tọ and whose height is Tọ, so that the 
area under the curve is independent of To. 

The discussion of dc conductivity showed that inverse relaxation times are proportional 
to scattering rates. The purely free-particle system, without any inhomogeneity, corresponds 
to the case where 1/tTọ = 0 or Re[o(@)]| « 5(@). The scattering processes contribute to To. If 
the Drude expression were rigorous, one would only need to find Tọ, which in fact is finding 
1/T). The limit œ — 0 in (9.9) gives the de conductivity Re(o) = nge?ty/m. 

If the scattering rate 1/t» is small, then Tọ is large, and the dimensionless product wtp is 
large except at very small frequencies. Expand (9.9) for mt, >> 1 and find 


2 
4nRe[o(o)) = —2-| 1 +0{1. (9.11) 
O T (Wt) 
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At higher frequencies the real part of the conductivity is proportional to the scattering rate 
1/T ), which is small by our assumption. This result is what is meant by the limit of weak 
scattering. The method of force—force correlation function casts Re[o(@)| in this form, as an 
expansion in the scattering rate. The resulting theories are valid at high frequencies where 
OT, > 1. 


9.1.2. Force—Force Correlation Functions 


The first theorem to be proved has already been mentioned, namely that the homo- 
geneous electron gas does not absorb any light. This assertion is a by showing that the 


Hamiltonian commutes with the momentum operator P = $` KC oCko 
Hy = X &pChoCpo +> > De 1Chg,0’Ch-q0Cpo Cko (9.12) 
po Vp kq o0’ 
d . 
0 = — P = i[Ho, P] (9.13) 


dt 


The momentum operator has no time dependence and is a constant of motion. 
The absorption of light is described by the Kubo formula, which is the correlation of the 
momentum operator with itself: 


Re[o(o)] = e (= 


émv\ o )[ aero * P(0)) (9.14) 


where the current is proportional to the momentum j(t) = (e/m)P(t). The lack of time 
variation in P means that the real part of the conductivity is identically zero for any nonzero 
frequency. There is no light absorption for œ Æ 0. 

A simple way of deriving the force—force correlation function was given by Hopfield 
(1965). The argument proceeds in two steps. The first step is to replace P(/) by the identical 
operator (i/@)dP/dt, which is justified on the grounds that in an ac field of frequency œ the 
time dependence of interest is just exp(—imt). This replacement is done for both P operators, 
which gives the expression 


á -Bo rl dPO [RA 
Re[o(o)] = TETEN (1 — eP | ae a Ze] ) (9.15) 


The second step in the argument is to notice that the time derivative of the momentum is just 
the force F(t) = dP/dt, so the force—force correlation function is 


Re[o(o)] = (1 — cyl dte™ (F(t) - F(0)) (9.16) 


6m2o2v Loo 
This derivation is essentially correct and will be justified later with more rigor. The force- 
force correlation function then serves as another possible starting point for the evaluation of 
the conductivity. For free-particle systems, it usually leads to a usable answer with much less 
work than the current—current correlation function, because the first term in the perturbation 
expansion of the force—force correlation function is usually the answer. For example, if the 
force is due to phonons, then F x M,(q), where M,(q) is the electron-phonon matrix 
element. The leading term in the correlation function is proportional to M?, which is often the 
largest term. The remaining parts of the correlation function are evaluated assuming the 
particles are free, which is quite easy. Similarly, if the absorption is due to impurities, the 
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leading term is proportional to concentration n; of impurities, which is often the important 
term. The advantage of the force—force correlation function is that the leading term is often 
adequate. 

One calculation for which the force—force correlation function would be a poor starting 
point would be the dc conductivity. The correlation function has a prefactor of m~*, and 
obviously a great deal of cancellation in the correlation function has to take place in the limit 
œ — 0 in order to get © = nge?ty/m. 

A rigorous derivation of the force—force correlation function was given independently by 
Mahan (1970) and Hasegawa and Watabe (1969). The trick is to start from the Kubo formula 
and integrate by parts on the t variable: 


1(e 
(io) = — > | dre™ (T, j(t)j,(0)) (9.17) 
| uav = uv — | va (9.18) 
l eiot 
u = j,(t), v= = (9.19) 


The integration by parts gives 


dj,,(t) 
dt 


l 1 P on 
io) = —— [( j | (0) —j — me T. (0 9.20 
nlio) = — [Ju Bu) — JOON + =| are ( eit ) (9.20) 
The first term is the constant of integration. These constants have the general form of 
commutators: 


(A(B)B(0) — A(0)B(0)) = Tr(de°” B — e~?" AB) (9.21) 
= Trle~ "(BA — AB)] = —([A, B]) (9.22) 


where the cyclic properties of the trace are used to rearrange the first term. The one in (9.20) 
is Zero since it is the commutator of j,,(0) with itself. 

The next step in the derivation is to take the integration by parts on the other current 
operator j (0). Since the correlation function depends only on the difference of the two qt 
values in its argument, this correlation function is also equal to (Tr[(dj,(t’) /(4t') yo jy (—?)). 
The integration by parts on the term j,(—t) gives 


= -([ii0,[42]_J) 
no) = (iO ae 


1 6 of [au EA 
“workin ba la |p 8 


The first term contains the commutator of j, with dj, /dt. This commutator yields a constant, 
which is real and so does not contribute to the real part of the conductivity or to the 
absorption of light. But Stmanek (1971) has shown that this term does contribute to the 
renormalization of the optical mass so it is retained. The other term in (9.23) is the frequency— 
dependent part, which is indeed the correlation function of dj/dt with itself. The current is 
just proportional to the momentum j = (e/m)P, so (9.15) is derived again. 

The Hamiltonian is written as H = Hy + V, where Hy is the homogeneous electron gas 
in (9.12), while V is the potential which causes other forces besides electron—electron inter- 
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actions. For example, the potential could be the sum of the interactions with the crystalline 
potential Vg where the G are reciprocal lattice vectors, with impurities of density operator 
p,(q) (see Sec. 4.1.5), or with phonons: 


V = paa) (9.24) 
q 
_ ra M, (q) 
D(a) = g= VG + p:(q) + LT N (dq, + a'a) (9.25) 
The time derivative of the current operator is 
J; 
E = [j = ajal =- LOM, PO) (9.26) 
Li, P(A] = — => kl CE Css Chg Cp] (9.27) 
kpss’ 
=47 Chpqs Cral + 9) — kd = = 4,01) (9.28) 
i 
at = -ÉI q,p(a) (a) (9.29) 
q 


The last commutator is just Newton’s law: the time derivative of the momentum is the force, 
which is the gradient of the potential 


FŒ) = -V, V(r) = i E V (gae (9.30) 
q 
The evaluation of the constant term in (9.23) requires the additional commutator 
| “| = -ÉY pO, PA] (9.31) 
e dt mg" H? l 
an 
-7z 2 IPP) (9.32) 
q 


Using these results in (9.23) gives 


. e / p iO, T 
nhio) = mlo” - > aiD) + 3 TAN dte" 


q 


x (T,p(q, t)(q, t)p(q’, ODd, o| (9.33) 


The generalized potential ®(q) is given a t dependence, which applies only to the phonon part 
of the three terms in (9.25). The second term is the force—force correlation function, which is 
equivalent to (9.16). It is often sufficient to evaluate this expression only to order ®°. The 
second term is already evaluated to this order, so that the remaining correlation function is 
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evaluated ignoring the interaction V In this case q’ = —q. The first term in (9.33) must also be 
evaluated to order ©” by expanding the S matrix for the interaction potential V: 


(p(q)®(q)) = Tr[e~°”,S(B) p(q)®(q)] 
B 
= Tr[e™ p(q)(q)] — | dtTV pla, 0)®(q, 0)) + OT) (9.34) 


Only the term in Vis retained in order to have a result proportional to ®*. The first term on the 
right 9(p(q)®(q)), is zero. Combining both terms in (9.33) gives 


e? 


nlio) = -z qo) ip AN dr(e®™" — 1)(T,p(q, t)p(—4, 0)) (T,B(q, t)®(—q, 0)) lo 35) 


The term with exp(i@t) comes from the force—force correlation function, while the —1 term 
comes from the constant term. 

This formula will be evaluated for several different types of potentials in the remaining 
parts of this section. The type of formula which results can be illustrated by a simple example. 
Take the case where the potential ® is independent of t. One example is the scattering from 
impurities, where, according to Sec. 4.1.5, 


ODDA) = 5 (HQ) VCa’ NPP) (9.36) 


Mi 
= ~ VD Sgro (9.37) 


where V;(q) is the unscreened potential between the electron and the impurity. The remaining 
part of the t integral in (9.35) is the definition of the longitudinal dielectric function from Sec. 
5.4: 


i iO, T q? l 
tf dte™" (T.p(q, t)p(—q, 0)) = os Lia 1 (9.38) 
l l 
e(q, io) ela) 


ri(io) = ———,, x 4h q l 


Ti i@) 


| (9.39) 


The subscript i denotes the impurity contribution to n(iœ). The current—current correlation 
function is expressed as the difference between 1/e(q, im) and 1/e(q, 0). Since e(q, im) is an 
even function of iœ, in the limit where iœ — 0 it must behave as e(q, iœ) = &(q, 0) + Olio”, 
which shows that n; does not diverge in this limit. The retarded function is found by 
iù —> œ + id: 


F1 l 
T;(@) = Tni h “9 V;(q) a o) l (9.40) 
, dq 1 
Re[o,(@)| =~ os i y qn yV, Vi(qy In| 5 | + OV?) (9.41) 


Equation (9.41) is the Hopfield (1965) formula, which shows that the real part of the 
transverse conductivity is proportional to the imaginary part of the inverse longitudinal 
dielectric function. The latter quantity is interpreted as the rate of making excitations in the 
electron gas. The light absorption occurs because electronic excitations are created, while the 
creation rate must depend on the concentration of impurities n; in the system. This inter- 
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pretation agrees with our original assertion that the electron gas can absorb light only when 
the impurities, or other inhomogeneities, are present to dissipate the momentum. An obvious 
advantage of the Hopfield formula is that the complicated aspects of electron—electron 
interactions are naturally included in the dielectric function, for which one can use the model 
forms described in Chapter 5. 


9.1.3. Frohlich Polarons 


The optical absorption of free polarons was first calculated by Gurevich et al. (1962), 
who started from the usual form of the Kubo formula. In the Frohlich Hamiltonian (7.1), 


H= 2 EpC tC c + O0 Lag AyAg p% > E Cha oCpolaa + ata) 


(9.42) 
Anaa?” 


M? = 
0 /2m 


the electron-phonon interaction V is expanded using the usual S-matrix techniques. The real 
part of the conductivity is calculated to order O(«), which is the first nonvanishing term. The 
three diagrams which enter proportional to « are shown in Fig. 9.1. The first two are self- 
energy diagrams, while the third is a vertex contribution. The final result is obtained by 
adding contributions from all three diagrams. At zero temperature it is 


penon xa? 


Re[o(o)] = - JE TTO — o) + O(a") (9.43) 


The free polaron can absorb light only when the frequency œ > @ọ. Polarons can emit LO 
phonons only when the polaron energy exceeds the phonon energy œọ. The light wave excites 
the polaron to this energy, and then the polaron can emit the phonon. The light absorption 
happens by the emission of LO phonons, where the electron serves as the intermediary in the 
process. Of course, there is also a direct coupling between the photon and the TO phonons, 
which was discussed in Sec. 4.6 on polaritons. 

Formula (9.43) can be derived from the correlation functions in (9.35). The part 
involving the potential is now just the Green’s function for LO phonons: 


M 
O(q) = NA + aa) (9.44) 


2 
(T,®(q, )®(—4, 0)) = 3.9%) (9.45) 


Or > 


FIGURE 9.1 
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Denote the density—density correlation function by the symbol x(q, t). The t integral in 
(9.35) is evaluated by changing it into a summation over Matsubara frequencies, which is 
evaluated by the usual contour integral: 


x(q, t) = (7. Pa t)p(—q, 0)) (9.46) 
. — Mge iT __ (0) 4 
(iw) = - ve 7 l dt(e 1)x(q, 1)2 (1) + OM) 
= _ Moe i GO GQ IAG iw + ig) — x(q, iq)] 
mlio v T E apy 
_ Me q4 [~ de 0) 
=- EDA] ORM ORA o +0) 
+ 2 Im[x(q, ©]2® (e — io) — 2 Im[D(e)y(q, €)]} (9.47) 


Take the retarded function ia —> œ + ið, and the real part of the conductivity is the imaginary 
part of the preceding expression, where the last term does not contribute: 


au | dq af 
mor J OnP E] 


x {Im[D (e)]Im[x(q, £ + ©)] — Im[x(q, &)Jim[DO (e — @)]} (9.49) 


Re[o()] = E nal) (9.48) 


This result is general and applies to all electron-phonon systems. Sometimes it is useful to 
change variables £ — £ + œ in the second term, which gives the equivalent expression 


2 42 3 2 poo 
Re o(@) = ee | al de — [ng(£) — ng(e + @)] (9.50) 
x Im[D (e)]Im[x(q, ¢ + @)] (9.51) 


The spectral function for the phonon Green’s function is easy to evaluate: 
—2 Im[D()] = 2n[5(e — ©) — (E + ©) (9.52) 


The last step in the formal derivation is to determine x(q, œ). The free polarons are assumed 
to have small concentration no, so that electron—electron interactions can be neglected. Then 
the density—density correlation function is adequately approximated by the simple bubble 
diagram: 


x(a, 10) = P(g, 10) = 5 E OID +9) (9.53) 
P.P 


3 
Poq o)=2| n 1 | 
(2n)° "PN pg toti Ep — Eptg — © — 18 


—2 Im[P“(q, w)] = 27 | 


(9.54) 


(2 m n, [Ò(Ep — Enig T ©) — ÔlEp — Ep+q — ©)] 


The factor of 2 in front is for the spin degeneracy. The correction terms to this simple bubble 
term, from the electron-phonon interaction, contributes to higher-order terms in «a. 
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The polarons are assumed to obey Maxwell—Boltzmann statistics, so the particle density 
n, per spin state is 


3/2 
Ny (2B 
= — | — — 9.55 
Mp = 5 ( m ) exp[— Pep] (9.55) 
The present calculation is for zero temperature. The particles all approach the state with 
p — 0, so from (9.54) the spectral function is 
—2 Im[x(q, @)] = 2nn [òl — £4) — Slo + £4)] (9.56) 


Now evaluate the real part of the conductivity given in (9.51). Both spectral functions are 
given by delta functions, for phonons in (9.52) and for electrons in (9.56). One set of delta 
functions are used to eliminate the de integral, which gives 


2Mee*ng | dq dh 
m?o? ie q? 

x [5(@p + @ — Eq) — Òl + @ + Eq)] 

+ [1 + ng(@o) + ngo — o)l — M — Eg) — Òl — Oo + Eq)]} (0.57) 


Re[o(@)] = x {[ng(@o) — ng(@p + ©)] 


At zero temperature, the boson occupation factor is ng(œ) = —@(—@). This expression 
simplifies for œ > 0, cos? @ = gi/q’ to 


2Mee’no f dq > 
_ Do 9.58 
Re[o(@)] = 7 “Jo = cos” Olo — Wp — Eq) (9.58) 
242 
— n Moe mo (=) 2m J0 =, @(o — 0) (9.59) 
m2o3(2n) \ 3 


Using the form for the electron-phonon matrix element M gives the result (9.43) of 
Gurevich et al. The derivation has been remarkably short. The general form of the answer in 
(9.51), as the product of two spectral functions, is quite familiar from other calculations, and 
in this case both spectral functions are delta functions. 

The optical absorption of Frohlich polarons has been calculated extensively by Devreese 
(1972). He calculated the terms to order O(a”), which are the two phonon terms. The polaron 
can be excited by the light and make two phonons as well as one. Results are shown in Fig. 
9.2 for the value of the coupling constant « = 1. The solid line is the result using (9.43) which 
includes just the one-phonon processes, while the dashed line is the result including both one- 
and two-phonon processes. The rather surprising result emerges that there is no sudden rise in 
the absorption at the two-phonon threshold œ = 2@,. The sharp rise at œ = Wp is not repeated 
as a sharp rise at 2@,. Instead, there is a very gradual rise starting at 2m, and a slight decrease 
in the peak height. The two-phonon events make only a small change in the result. The 
polaron would rather emit two phonons as two separate events of one-phonon emission and 
not as a correlated event where two phonons are emitted. This conclusion is in accord with the 
remarks in Sec. 7.1 about the correlation between two phonons being small in the polaron 
cloud. Devreese (1972) has also calculated the optical absorption of polarons by the Feynman 
method of path integrals as well as by strong coupling theory. At larger values of polaron 
constant « he finds sharp absorption lines, which are interpreted as arising from internal states 
of the localized electron in the small polaron. 
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FIGURE 9.2 Theoretical prediction for the optical absorption of free polarons with « = 1 and T = OK. The solid 
line is one-phonon theory, and the dashed line has two-phonon correlations. Source: Devreese (1972, p. 93). 


9.1.4. Interband Transitions 


Another application of the force—force correlation function is to calculate the rate of light 
absorption by interband transitions in a metal. Here the potential which provides the inho- 
mogeneity is the crystalline potential of the atoms. This potential 1s periodic in crystalline 
solids, so it contributes momentum in discrete units called reciprocal lattice vectors G. This 
contribution was mentioned earlier as the first term in (9.25): 


(q) = 3 VgSq=G (9.60) 


This potential is fixed and has no correlation with itself: (O®(q)®(q’)) = (®(q))(®(q’)). 
However, the other correlation function in (9.35) is (p(q, t)p(q’, 0)) and is nonzero only when 


q= -q: 


(®(q)®(—q)) = 3 Ve V_cða-c = 3 Ve8q-c6 (9.61) 


This potential is static, so the correlation function of the potential is a constant. This case is 
similar to that of impurity scattering, which was given in (9.41). The impurity result can be 
used for the crystalline potential by changing n; ACO to Ve8q-c- The interband absorption is 


Re[o/(w)] = — — 3 GGV in| | + OVÈ) (9.62) 


The subscript J stands for interband contribution. Note the following: 


\ & 
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where 1/|e|? is usually grouped with V& to have a screened electron-ion interaction 
\V</e(G, w)|*. If the RPA result in Sec. 5.5 is used for €,(G, œ), then when G > 2k, 


&(G, 0) = |e - (Z 7) (© - ca) 


Gi Ve | my? 
Go je 7 (3) (© -= 8c) i 


This formula is nearly identical to a standard formula for describing interband transitions in 
metals: the Wilson—Butcher formula. 

The Wilson—Butcher formula is used to describe the interband transition in the alkali 
metals; Figure 9.3(a) shows the band structure in an alkali metal in the k = (110) direction. 
The Fermi level is shown as the dashed horizontal line. The interband transitions are shown, 
where the transition appears vertical in wave vector space since the wave vector of light is too 
small to show up as a horizontal deflection of the arrow. This transition is shown in the 
reduced zone scheme. Of course, in the extended zone description of Fig. 9.3(b), the optical 
transition really changes the wave vector of the electron by a reciprocal lattice vector G. This 
change in the momentum of the electron, during the optical transition, is precisely what it 
needs to have the transition. The optical absorption occurs because the electron gains energy 
from the light wave and momentum from the rigid crystal lattice. 

The optical absorption one expects from an alkali metal is shown in Fig. 9.4. At low 
frequency there is a Drude contribution, which comes from phonons. This contribution is well 
described by the Drude formula (9.9), where Tọ is largely provided by the intrinsic scattering 
from phonons. The interband transition of Fig. 9.4 starts about A@ = 2 eV, with a linear rise at 
threshold. Since the Fermi surface does not touch the zone face, kr < G/2, and (9.64) is the 
appropriate form to use for €,(G,@). Figure 9.4 shows the experimental result of Smith 
(1969) for sodium, which has exactly this behavior. Similar interband terms were found by 
Smith (1970) for K, Rb, and Cs and in Li by Myers and Sixtensson (1976). The agreement 
between theory and experiment appears satisfactory for sodium. There is no reason to expect 
the Wilson—Butcher theory to give perfect agreement, since there are additional terms which 
are higher powers in O(V2). Similarly, most calculations of the Wilson—Butcher formula use 
the RPA form for e(G, œ), whereas, the Singwi—Sjélander form would be an improvement. 
There is also a slight uncertainly as to the precise value of the pseudopotential V,,). With 
these qualifications in mind, the agreement between theory and experiment in Fig. 9.4 is 
satisfactory. 

There is one difference between the original formula of Wilson—Butcher and the result 
(9.64) derived from the force—force correlation function. The original formula had the 
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FIGURE 9.3 Interband transitions in alkali metals as shown in (a) reduced zone and (b) extended zone 
representations. 
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FIGURE 9.4 The optical conductivity for metallic sodium. The points are the experiments of Smith, while the lines 
are the theoretical contribution from Drude and interband transitions. Source: Smith (1969). 


electron ion potential divided by the static dielectric function e(G, 0). The present derivation 
by the force—force correlation function is exact to order V2, so its factor of e(G, œ) is correct, 
and the old formula is not. The difference is more philosophical than numerical for sodium, 
since the frequency dependence of e(G, œ) is slight over the range of frequencies shown in 
Fig. 9.4. Actually these frequencies, are not much lower than the plasma frequency at 
Aw, ~% 5eV, but this fact is irrelevant, since at these large values of wave vector G the 
dielectric function becomes frequency dependent only at much larger values of o. 

The alkali metals are good examples of where the Wilson—Butcher formula works well. 
Here the lowest interband transition is in the visible region of the spectrum and 1s isolated in 
frequency from any other feature except the Drude contribution, which is always present. 
Ashcroft and Sturm (1971) have shown that another formula is needed for polyvalent metals 
such as aluminum, since there are transitions between parallel bands which cause sharp 
structure at lower photon frequencies. 


9.1.5. Phonons 


Equation (9.25) lists three contributions to ®(q). The first two terms are Bragg scattering 
and impurity scattering, which have been discussed. The third term is treated here. It is due to 
the electron scattering by the phonons. 

Equation (9.35) is evaluated, with ®(q) being the electron-phonon interaction. Only 
terms are retained that are of O(M?). This point is rather tricky, since a subset of higher-order 
contributions are also retained. It is assumed that the phonon modes are the actual modes in 
the solid. The calculation of these modes includes electron—electron and electron-phonon 
interactions, which means they include M,. Any time the actual phonon modes are used, then 
one is including a subset of higher-order diagrams. 
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In Eq. (9.35) the phonon part is 
l 
-(T,®(q, (4, 0) = SEMA, 1) (9.65) 


Equation (9.35) is now evaluated. Only keep the term that depends upon frequency iœ: 


qM. 
t(i@) = or TS a w| 


vm? (io) qa 4 


] 
e(q, 10) — iq) | 


The summation over Matsubara frequencies is evaluated using the techniques of Sec. 3.5. The 
phonon Green’s function is expressed in the Lehmann representation using its spectral 
function B(q, œ). The inverse dielectric function is expressed similarly, with its spectral 


function being Im(1/e) 
Em 1] ng(o’) — ng(€’) 
w’) < 
ala, gc')| io +o -e 


rio) = -aa Dah Ma |< 


Anvm?(iw) g 


(9.66) 


The retarded function is obtained by taking iœ — œ + ið. Then take the imaginary part. 
These steps cause the denominator (iœ +’ —&) to be replaced by the numerator 
nòl + æ — e’). The delta function is used to eliminate the integral over de’. 

This expression is usually evaluated at relatively low frequencies. When œ is much less 
than the plasma frequency of the metal, use the low-frequency limit for e (q, œ) = 
2mm? e*q-?@(2k,- — q). The various terms involving the phonon wave vector, matrix element, 
and spectral function are just the definition of the transport form of «2 F(u). The remaining 
factors combine to give 


Re[o(@)| = — mimo) (9.67) 
= Eo f dua POOO = inal) — nau — 0) 
+ (@ + u)[ng(u) — nglu + @))]} (9.68) 


At zero temperature, the boson functions ng arrange to limit the range of integration to 
0 < u < œ. This limit gives the simple formula first derived by Allen (1971): 


Re{o()} = one No 


ef dux? F(u% — u) (9.69) 


The Allen formula is approximate, since it is based upon the force—force correlation function. 
A more accurate expression would be based upon solving the Kubo relation for the 
conductivity as in Sec. 8.4. This derivation gives the result 


en [© 
Re[o(o)] = — To de[n-(€ + œ) — n-(E)] (9.70) 
x Re[y(e — ið, € + œ + id)/(E, £ + w)] (9.71) 
I(e, £) = oo (9.72) 


Qe) — Qe) + P(e) + Te] 
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FIGURE 9.5 Phonon-induced absorption in calcium as a function of frequency. The vertical axis is 
Re[o(@)mw/e*no]. The horizontal axis is ©/@®p, where the Debye frequency is œp. The solid line is the solution 
to the ac transport equation, while the dashed line is using the Allen formula from the force—force correlation 
function. The latter method is approximate. Source: Wu and Mahan (1984). 


This result can be derived directly from (8.204) and (8.213). The vertex function 
y(e —id,€ +œ + ið) obeys the integral equation (8.228). A comparison of these two 
expressions is given in Fig. 9.5 for metallic calcium with E||c. What is plotted is Re(o/6 ) vs. 
(0 /@p, where Gy = nge? /mo. The solid line is the exact result from (9.70) while the dashed 
line is the Allen formula in (9.69). The Allen formula has an error of about 10-15%. This 
makes it a good approximation, since (9.69) is very much easier to evaluate than (9.70). The 
latter requires solving numerically an integral equation for the vertex function. 


9.2. WANNIER EXCITONS 


9.2.1. The Model 


Exciton states play an extremely important role in the understanding of interband 
transitions in semiconductors. The word exciton is used here to signify the modification of the 
absorption rate of photons due to the Coulomb interaction between the electron and the 
valence band hole. The easiest case to understand is when the lowest interband transition 1s 
direct: which means that the conduction band minimum and valence band maximum are at the 
same point in k space. This configuration is shown schematically in Fig. 9.6(a). The valence 
band states are all filled, and the conduction band states are all empty. The vertical arrow 
shows a possible interband transition which can occur when a photon is absorbed in the solid. 
The arrow starts in one of the valence states, since these are occupied by electrons, and goes 
to an unoccupied state in the conduction band. The valence state is shown as nondegenerate 
(except for spin) at k = 0, although that is seldom the case; usually the band has an orbital 
degeneracy and is anisotropic. 

The point of view in Fig. 9.6(a) is a single-particle picture of the transition process. 
According to this picture, the transition rate for the absorption of photons is given by the 
golden rule: 


A(o) = = © (c, K'IÊ + plv, k) òle,(k) + Ao — 6 (K’)] (9.73) 
kk’ 
k? k? 

e,(k) = — om é.(k) = E, + om, (9.74) 
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FIGURE 9.6 Optical transition in a semiconductor between the occupied valence band and empty conduction band 
for a direct transition. (a) Conventional band picture; (b) Wannier picture where the photon makes an electron-hole. 


The energy zero is chosen to be the top of the valence band, so the bottom of the conduction 
band starts at the energy gap E,. The matrix element (c, k’|é + p|v, k) is between the one- 
electron initial and final states. The wave functions are taken to be Bloch functions; which are 
the product of a cell-periodic part u,,(1r) and an envelope function exp(ik-r). It is a 
reasonable approximation, for our simple level of discussion, to take the cell-periodic parts as 
independent of wave vector u,,(r) = u(r), u(r) = u,(r). It is also assumed that the valence 
band has p symmetry and the conduction band has s symmetry, which is true in many 
semiconductors but not all. Then u,(r) is a periodic orbital with angular momentum / = 1, 
while u,(r) is a periodic orbital with / = 0. With these approximations the optical matrix 
element is a constant except for wave vector conservation: 


eX" 
lv, k) = u,(r) K (9.75) 

, gik’ r 
lc, kK) = ult) (9.76) 
(c, K'IÊ + plv, k) = (clê + plo) Sy = ê * Pedra’ (9.77) 

a I a 
(clê + plv) = =] d’ru*ê + pu, (9.78) 
Vo Jcell 


where vọ is the volume of a unit cell. The matrix element (c|p|v) should also contain the 
factor exp(iq ° r), where q is the wave vector of the photon. This wave vector is small enough 
to be neglected for photons in the optical frequencies. Now that the matrix element is eval- 
uated, it is possible to do the integrals over the electron wave vector: 


2T a d’k k? 
A(q@) = — l€ ° Pol Jem 5( ha — Eg — E) 


- 2 3/2 
=|é-p,, a ho — E,O(ho — E,) (9.79) 
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Equation (9.79) predicts that the absorption rate begins at the energy gap of the semi- 
conductor and rises as the square root of the factor (hœ — E£),. It is a very definite prediction, 
which is not observed. In fact, the one-particle theory is totally inadequate and does not come 
close to describing the absorption spectra A(w) observed in experiments. 

Wannier (1937) observed that the interband transition in semiconductors was really a 
two-particle process, which is indicated in Fig. 9.6(b). Since the valence band states are all 
occupied, removing an electron from this band creates an excitation called a hole. This hole, 
which is the absence of an electron in an otherwise filled band, can be treated as a particle 
with an effective mass m, and a positive charge. The energy of the hole is ¢,(k) = k*/2m,. In 
the two-particle picture, the photon of energy creates two excitations in the semiconductor: 
the electron in the conduction band of wave vector k and energy E, + k* /2m, and the hole in 
the valence band of wave vector —k and energy k*/2m,. Energy conservation is 

k? k? k? 


=E,+— 9.81 
2 om, aor 6-8) 


ho = E 
W st > 


which is exactly the same as the argument of the delta function for energy conservation in 
(9.79). So far the two-particle picture leads to the same absorption rate as the one-particle 
prediction in (9.79). Both the matrix and the energy conservation are identical, so the 
prediction is exactly the same. 

The point made by Wannier is that there is now additional physics which can occur in the 
two-particle picture. The electron and hole are particles with charges of opposite signs, so that 
there is a Coulomb attraction —e?/(e,r) between them, where £ọ is the static dielectric 
function. This dielectric function is assumed to be a constant which is independent of 
frequency. This approximation is poor, since most semiconductors are polar, and the dielectric 
function has significant dispersion at frequencies near the optical phonon frequencies; see 
Sec. 6.3.1. The frequency-dependent screening is an interesting problem which is ignored by 
treating £ as a constant. The attractive Coulomb interaction between the electron and hole 
can cause hydrogenic bound states between them. 

The optical absorption rate for this process was calculated by Elliott (1957). The final 
state of the system is described by a two-particle Schrödinger equation: 


P(r, rp) = Uc(Ke My (K,) (Ke, Vr) (9.82) 
RV KPV e? 
= | —-— - —— - ——__- E |ð(r,, 9.83 
0 2m, 2m, Eq |i. — r,| l (re rn) ( ) 


O(r,,r,) can be factored into relative r=r,—r, and center of mass coordinates 
M = m, + m, in the standard fashion: 


MYL, + MV) 


R= (9.84) 
e eR 
Dire, ra) = Wi o(r) (9.85) 
ho e 
(-Ev -T s Jo) =0 (9.86) 
pP? 
E = E, + £, +5 (9.87) 
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The center of mass motion is plane-wave-like, with a wave vector P which in optical 
experiments is equal to the photon wave vector. This wave vector is small. It is neglected in 
the center of mass motion and set P = 0. The relative motion of the electron and hole is 
usually more interesting. For relative energy £, less than zero, the two particles form bound 
hydrogenic states with energies £, = €, = —Ep/n’. For relative energy £, greater than zero 
they form scattering states o,(r). Elliott showed that the optical transition rate depends on the 
relative wave function at r = 0, ,(0). Instead of (9.79), the transition rate is determined by 


Ao) = |e + pal? X14)? ~ E- 6) (9.88) 


The summation j is over the bound and continuum states of the relative motion of the 
electron-hole pair. The dependence on the relative wave function, evaluated at r = 0, can be 
understood by a physical argument. First, consider the corresponding emission process, 
whereby the electron and hole recombine to emit a photon with energy Aœ. The emission rate 
should depend on the relative wave function at r = 0, since that is the probability that the 
electron and hole find themselves at the same spot in the solid, which is necessary for the 
recombination. The dependence of the emission rate on 1 ,(0)|? is reasonable from physical 
intuition. But the matrix elements for absorption and emission are identical, so that absorption 
should have the same dependence. The absorption depends on the probability of making the 
electron and hole at the same point in the solid. 
The former result, for the noninteracting solid, is recovered from (9.88) by setting 
1 ,(0)|? = 1/v and £; = k* /2u. The results, for the interacting case are quite different. The 
relative motions of the electron and hole are in s-wave hydrogenic states, either bound or 
unbound, because of the angular momentum selection rule. The one-unit change in /, in the 
photon absorption, is taken by the change of band symmetry, and the relative motion is not 
permitted any additional angular momentum. For s states, the bound states have an amplitude 
given by the principal quantum number n and the Bohr radius ap: 
$,(0) = = (9.89) 
y Tan? 
For continuum states, with energy £, = k*/2u, the relative wave function at the origin is 
lPk (0) = 2xn/v[1 — exp(—2rn)], where 1/n = kao. Then (9.88) predicts that the 
absorption is a constant in frequency at the energy gap £,, and does not rise with a square 
root dependence on (fm — E,). The actual shape is shown in Fig. 9.7. A few sharp, distinct 
exciton lines are observed at low frequency which correspond to Is, 2s, etc. In actual 
experiments, these absorption bands are hard to see because they are too strong. Only in the 
thinnest crystals can enough light be transmitted through the experimental sample to measure 
the absorption rate at the bound states of the excitons. Usually the absorption is so strong that 
all the light is attenuated before transversing the sample. At higher frequencies, the ns states 


FIGURE 9.7 Typical absorption spectra of a direct gap semiconductor. Distinct exciton lines are labeled 1s, 2s, etc. 
The solid line is the theory with final state interactions, and the dashed line is the theory without them. 


596 Chap.9 e Optical Properties of Solids 


are closer in frequency and are broadened, so that their envelope becomes a continuous 
distribution which merges with the continuum absorption which starts at iw = E,. This 
experiment shows no anomaly at œ = E,, and the experimental value of E, is obtained only 
by extrapolating the Rydberg series for the positions of the ns exciton lines: 
ER 

oO, = Eg + E&E, = Eg — n2 (9.90) 
In most semiconductors, the Rydberg unit Ep is of the order of several millivolts because of 
the high values of the dielectric constant £ọ ~ 10 and the small values of the effective mass 
uw 0.1. 

This discussion is only a brief review of exciton theory in semiconductors. The interested 
reader is encouraged to seek additional information in the books by Knox (1963) and 
Reynolds and Collins (1981). The theory has been verified by many detailed experiments, 
such as exciton properties in static electric or magnetic fields, under stress, etc. Exciton effects 
are important for determining the absorption rate of photons within a frequency interval of Ep 
near the energy gap £,. 


9.2.2. Solution by Green’s Functions 


The Elliott formula (9.88) is derived using Green’s functions. The starting point is the 
Hamiltonian (9.86) which contains the properties of electrons, holes, and their mutual 
interaction. The Kubo formula is evaluated for the optical conductivity, which is proportional 
to A(w). The Elliott forumula is obtained by a summation of all the vertex diagrams of the 
current—current correlation function. The Green’s function analysis is straightforward and can 
be solved exactly, because it is still only a two-body problem. Indeed, this section could have 
been included in Chapter 4 on exactly solvable models. The motivation for this analysis is to 
stress the importance of final state interactions. The latter is a fancy name for the interactions 
between the particles in the final state of the optical transition. In this case they are the 
Coulomb interaction between the electron and hole. | 

The starting point for the Green’s function calculation is the Kubo formula for the 
current—current correlation function: 


1P, 
n(io) = — 5] dre" (T, j(t)j,,(0)) (9.91) 
0 
ja = E W(Ckod-x-o + Toke) (9.92) 
Oo 

m = | PPO, Dp, Y, r) = (e, klp, l», K) (9.93) 
The t development of the operators is governed by a Hamiltonian which contains the 
Coulomb interaction v, = —4ne*/&)q* between the electron (Cko, Ci) and the hole 

(Ao: dko) 
H =H} +V (9.94) 
Hy = DE lW)djdy + X EWICLC, (9.95) 


1 
V=- Do (x Aac) (x dia) (9.96) 
q k p 
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Spin does not play a role in this analysis, so the spin index is no longer written. The procedure 
is to solve the Kubo formula (9.92) with this Hamiltonian. The optical matrix element w, is 
taken here to be a function of k, although in the theory for Wannier excitons it is a constant. 
An important assumption, when evaluating this correlation function, is that the densities of 
electrons n, and holes n, are both negligibly small. An equivalent assumption is that the 
chemical potential u in the semiconductor is in the forbidden energy gap, somewhere between 
energy zero and E,. The chemical potential u, for the holes is the negative of that for the 


electrons u, = —He, and the energies in the preceding Hamiltonian are 
k2 
€.(k) = 2m, 1 8 — u (9.97) 
k? k2 
&a(k) = Z- — W, = Im, +H (9.98) 


The definitions are finished, and it is time to evaluate the correlation function. 
The electron-hole interaction V is treated as the perturbation for the Hamiltonian. The 
first term is that for no interaction, which is called n® (iœ): 


B 
rO (io) = — ; y va | de" (TC, (t)d_,(t)d" ,,(0)C} (0) (9.99) 
k 0 
l 2 p iO, T 
= ve > we dte ” G (Kk, 1)G,(—-k, T) (9.100) 
=- Dg YG .Ak, ik)G,(—k, io — ik) (9.101) 
k BE 
1 2 1 — npe) — nr (Čr) 
~V~ 0) Eb) 210% 


The last step involves a standard frequency summation, as given in Sec. 3.5. Both Green’s 
functions G, = 1/(ik — €,) and Y, = 1/(ik — &,) are for noninteracting particles, but the 
superscripts (0) are omitted since all the Green’s functions in this section are noninteracting. 
The occupation factors np are zero. The absorption A(œ) is the spectral function of this 
correlation function. Take the analytical continuation iœ —> œ + ið and define 


A(@) = —2 Im[r(%)] (9.103) 


3 
= wz8(o — E(k) — E(k) 


Pk k? 
= 27 ana a — E 1-5) (9.104) 


The result for AV (œ) is recognized as the one-particle theory of (9.79). The remaining terms 
in the expansion of the S matrix are now examined. Fortunately, most of them are zero. All 
self-energy terms, for either the electron or the hole, are zero. All self-energy terms for the 
electron or hole contain closed loops and vanish since they are proportional to np. The only 
nonzero contributions are vertex diagrams in the form of ladder diagrams. 

The method of summing the ladder diagrams follows Mahan (1967) and was used for 
impurity scattering in Sec. 4.1.5. They are shown as the diagrams in Fig. 9.8. The first vertex 
correction A“) contains a single ladder diagram, where the dashed interaction represents the 


AO) = —2 Im[x()] = 2n | 
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FIGURE 9.8 


Coulomb interaction between the electron and the hole. The two particles are represented by 
the solid line segments. The wiggly line at the vertex represents the photon. The lines for both 
the electron and hole are shown coming from the photon vertex and are traveling parallel in 
time. The diagram with n ladders is called A”. They can be constructed according to the rules 
for diagram construction. The first ladder diagrams gives 


1 l , , . 
nia) = ye > WkUk-kK Wk 5 2 G (k, ik)G,(—k, 10 — ic) 
kk’ i 
l 
x- EZ (k, ik)Z,(—k', io — ik’) (9.105) 
ik’ 


The two summations over Matsubara frequency are identical in form: 


1 — np (be) — np (6n) 

iœ — E(k) — &,(k) 
l 

~ ia — E, — k? /2u 


I 
LYG (k, ik)G,(—k, iw — ik) = 
ik 


1 J l 
(1) . _— , oO —?.h h_C.R.OO}O}OU 


(9.106) 


Additional ladder diagrams in A”) just produce additional factors of the Green’s function pair 
G.G,, which are summed over Matsubara frequency to produce another energy denominator 
(ia — E, — k? /2u). Each ladder also has another factor of the Coulomb potential Vk;—k; /v. For 
this problem it is easy to write the term with n ladder diagrams: 


1 Wyk, °° UK ok Wk 
i”) — > l n—1 n n , 
y”+l kk, ---k,, iO — Eg — k?/2u Tio — Eg — k? /2u 


The simple form of these terms permits them to be summed exactly to obtain the expression 
for the correlation function. The other vertex diagrams, which are not ladder diagrams but 
have a crossing of the interaction lines, are all proportional to the numbers of electrons or 
holes in the semiconductor. They can be neglected, and the only nonzero vertex corrections 
are the ladder diagrams. 

The summation of the ladder diagrams can be expressed as a vertex correction to the 
correlation function, which is written as 


, 1 wgl (k, iœ) 
Oym ly wk io) 9.107 
700) un v> wE, — k? /2u l ) 
, 1 Vk-k, 1 (k;, iœ) 
I (k, = -5 — 9.108 
(k, 70) Wet oE — 2p (9.108) 
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An exact summation of the ladder diagrams is obtained by solving for this vertex function. 
The integral equation (9.108) for the vertex function is an equation of the relative motion of 
the two particles. © 

The first step in the solution of the integral equation (9.108) is to define the auxiliary 
function 


eK "'T(k, io) 


] 
P(r, io) = -Yo 
(r, 70) vio —E, 2/2 


(9.109) 


For the Wannier exciton system, the matrix element w, is independent of k and is a constant 
wọ. In this case, the correlation function in (9.107) is the value of this auxiliary function at the 
origin: 


1, (i@) = WoP(r = 0, iœ) (9.110) 


The integral equation (9.108) for the vertex function becomes 
e . 
I'(k, io) = wọ — [erie "P(r, iœ) (9.111) 
0 


The auxiliary function P(r, iœ) is introduced so that one can write a differential equation for 
it, which is then solved to obtain the exact solution to the function. This differential equation 
is obtained by operating on both sides of (9.109) with the operator [iœ — E, + V7 /2u], which 
is chosen to eliminate the energy denominator on the right-hand side: 


Vv? lo x. io — E, — k’ /2u 
io — — io) = -Y` e™ "T (k, io) n5 .112 
(i E, + Pe iW) ye (k, iœ) ja E — 2p (9.112) 
_! e* TT (k, iœ) (9.113) 
Vk 
l ik - 3 E ikr , 
—— I r — 1 O l P f 
yee [ra | r ar (r, iœ) 
3 eo 
= w0 (r) — gr T i@) (9.114) 


The integral equation (9.111) for the vertex function was used to obtain a differential equation 
for the function P(r, iœ). The interaction term on the right is moved to the left-hand side of 
the equation, which derives the inhomogeneous differential equation: 


(io — E, — H.,)P(r, io) = wd? (r) (9.115) 
h? e 
H,, = -— V? -— . 
ox mY =r (9.116) 


This equation is solved in the following fashion. The factors iœ — E, on the left are constants, 
and the important operator parts are the same as found in the exciton Hamiltonian (9.86) for 
their relative motion: 


He. (r) = &;(r) (9.117) 
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The Hamiltonian H,, may be solved exactly in terms of hydrogenic eigenfunctions o,(r) and 
eigenvalues g;. In terms of these exact solutions, the solution to the differential equation 
(9.115) is written as 


(r)d; (0 
P(r, io) = wy PO (9.118) 
7 @—E,—& 
which is verified by examining the original equation: 

l , sn [O — E, — E; 

(iœ — Eg — Hax)P(r, iœ) = Wo 2 o(r)d; (0) in E, E 
= wo $ ,(1)b7(0) = woô’ (r) (9.119) 

j 


The last step is a summation over the complete set of hydrogenic states, which produces a 
delta function. This result agrees with (9.115), so that an exact solution is found for P(r, iœ). 
The current—current correlated function 1,(i@) is obtained from (9.110) as the value of 
Wo P(r = 0, iw): 


I; (O)|? 


in —E, — &, 


T (io) = wo > 


J 


(9.120) 


The optical absorption function A(œ) is the spectral function of this operator: 


Alo) = 2rw$ > \b,(0)|°8(@ — E, — £;) 
J 


This result is exactly the Elliott form in (9.88), with the transition rate depending on the 
square of the relative wave function evaluated at r= 0. As discussed in Sec. 8.2.1, this 
dependence causes a dramatic and important change in the absorption spectra. The final state 
interactions are very important. They are obtained by summing the ladder diagrams of the 
correlation function. It is an exact solution to the current—current correlation function (9.92) 
for the model Hamiltonian in (9.96) and with the restriction that there is a negligible density 
of electrons and holes in the semiconductor. 


9.2.3. Core-Level Spectra 


Core-level spectra in insulators is another problem which is similar, both physically and 
mathematically, to the Wannier exciton. The initial electronic state in the optical transition is a 
core level of the atom, such as the Is or 2s shell. The optical transition in the insulator takes 
the core electron to an unoccupied state in the conduction band. The empty core level can still 
be described as a hole, but now it is very localized in space and so has an infinite effective 
mass. The hole state has a constant energy &, = E, + p where E, is the core-level binding 
energy, as measured from the top of the valence band. The threshold for the optical transition, 
in the absence of any final state interactions, is the frequency @r = Es + Eo 

This problem can be solved exactly by the same techniques which were used for Wannier 
excitons. The advantage of the present model is that it is possible to explain the physics with 
greater clarity. From the point of view of the conduction electron, it has two possible potential 
functions. There is a potential for the initial state of the system, which is called V,(r). It is the 
potential acting on a conduction electron in the initial state, when there is no core hole. The 
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effective Hamiltonian for the conduction electron is called H, and its eigenfunctions are 
Y(k, r) with energy ¢,(k): 


AY; = -iv + Zoi r) = g;(k)¥;(k, r) (9.121) 


After the optical transition has occurred, the hole appears in the core state, which alters the 
potential of the conduction electron to V;(r). This final state potential is set equal to the initial 
state potential plus a term due to the core hole: 


Vp =V, +V, (9.122) 


2 
H¥, = |- fy + ZOZ r) = &-(k)P;(k, r) (9.123) 


The core hole potential V,(r) has the form of a Coulomb potential —e? /(eọr) at long range, 
but there are atomic effects at short range. The final state Hamiltonian H; has its own set of 
eigenfunctions ‘¥,(k, r) and eigenstates ¢-(k) which are different from those of the initial 
state. These states may be bound or unbound, to the core hole, and k is a general index which 
can represent either of these possibilities. The conduction electron has two possible sets of 
eigenstates which could enter into the calculation for the absorption. Which one should be 
used? 

The conventional approach to this problem is to expand the Hamiltonian and the current 
operator in the basis states of the initial Hamiltonian H;. A set of creation and destruction 
operators (C,,,, Cl) are associated with the basis set ¥,. The effective Hamiltonian is 


H =, did, + YE(W)CL Ck + VK KCL Cyesdidy 
Ss ks kk’ ss’ 
E(k) = €,(k) — p 
(9.124) 


V,(k, kK) = [rr r)V,)Y¥ (k, r) (9.125) 
This Hamiltonian has the same general form as (9.96). The last term in the Hamiltonian is the 
interaction between the conduction electron and the hole; the latter is represented by the 


operators (d,, di ). The current operator, for the Kubo formula (9.92), is also written in terms 
of these functions: 


ja = DwAk\(Ch d} + dCs) (9.126) 
ks 


w;(k) = [err rjé > po,(r) | (9.127) 


where @,(r) is the core wave function of the hole. 
The problem has the same form as the Wannier exciton which was solved previously and 
is treated similarly. The correlation function, from the Kubo formula, is evaluated as a set of 
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ladder diagrams. Their summation is achieved by a set of equations which are nearly identical 
to (9.107) and (9.108): 


1 — w,k)*T (k, io) 
=-->% 9.128 
nlio) V 3 iœ — Wr — £;(k) ( ) 
. V, (k, k,)I ;(k;, iœ) 
I .(k, = w.(k oe a an ae Sea 9.129 
The vertex function I’,(k, iœ) is solved by introducing the auxiliary function 
Y(k, r)I (k, io) 
P(r, =) 9.130 
mio) = FT op E 0.130) 
1 a 
T(i@) = E| drot - pP(r, iœ) (9.131) 
The equation for the vertex function is 
I ,(kK, io) = [örr r)[È - pd.(r) + V(r) P(r, iœ)] (9.132) 


The correlation function P(r, i@) is solved by writing it as a differential equation, which is 
found by first operating by 


(io — œr — H;)P(r, io) = >| Y,(k, r)T;(k, io) (9.133) 
k 
= E Yk, r) | PrP (k, rE « po) + VOP, io) 
k 


= (drS — rE + poe’) + VPC, fo) 
= È - po.(r) + V, (r^P(r, iœ) (9.134) 
where the completeness relation is 


> F (k, r) PF (k, r) = ô(r — r’). (9.135) 
k 


This relation produces the delta function which eliminates the integration over r’. The 
potential term is moved to the left of the equals sign, which makes the combination 
H; + V, = Hy. The auxiliary function P(r, iœ) obeys the equation 


(iœ — ©r — H;)P(r, iœ) = £ + po,(r) (9.136) 
This equation is solved in terms of the basis functions ‘F, of the final state Hamiltonian H;: 


P(r, io) = X FC r) fai (k, WE > PO) 


9.137 
k iœ — Wr — E-(K) ( ) 


An obvious notation is to define the optical matrix element in terms of the final state basis 
functions: 


w(K) = [erea r)é - pd,(r) (9.138) 
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The correlation function in (9.131) is 


„a_l [we (k) |? 
A(@) = = lwe(k)|78[o — Or — &/(k)] (9.140) 


The absorption spectrum is the spectral function of this correlation function. Equation (9.140) 
is actually a simple physical result. It shows that the absorption spectrum A(@) can be 
obtained from a one-particle spectrum by using the golden rule. However, one has to use the 
wave functions Y(k, r) and eigenvalues Er of the electron in the final state of the transition. 
During the optical transition, the effective potential of the system changes from JV; to Vy. 
Formula (9.140) shows that the optical spectrum is calculated using Vp. This conclusion is 
important, not only in solid-state spectra but also in atomic physics. 

In a single-particle picture, this conclusion is obvious. The core hole wave function ,(r) 
is also calculated for the potential V,, since in calculating such states one does not include the 
interaction of a particle with itself. That is, presumedly @, is found from a Schrödinger 
equation with a potential which also does not have the core level occupied. In a single-particle 
picture, this potential is V;. The final formula (9.140) merely describes a one-particle tran- 
sition between two different energy levels of the same Hamiltonian H+. It is an unremarkable 
application of the golden rule for transition rates. It is possible to solve this many-body 
problem because it is a one-body problem. 

Unfortunately the present model is too simple to apply accurately to real spectra. One of 
the assumptions is that the changeover from V; to V; occurs instantly with the creation of the 
hole. In actual systems, the other electrons and ions take time to adjust to the core hole 
potential. This time can be viewed as a frequency-dependent screening process. A realistic 
calculation should confront the dynamic screening, which was carefully neglected at the 
beginning of the calculation. 


9.3. X-RAY SPECTRA IN METALS 


9.3.1. Physical Model 


The previous section discussed exciton effects in the core-level spectra of semi- 
conductors or insulators. Now consider the same optical transitions in metals. The photons 
usually have higher energy, so the spectroscopy is labeled X-ray or soft X-ray. The many- 
body theory of this spectroscopy is considerably more complicated, since the conduction 
electrons in the metal respond dramatically to the X-ray transition. These processes cause 
several new effects which were predicted theoretically and verified experimentally. Among 
them are (1) the prediction of Mahan (1967) that the absorption edges had a power law 
divergence near threshold, which has been found for the p-shell spectra of simple metals such 
as sodium, magnesium, and potassium, and (2) the prediction of Doniach and Sunjic (1970) 
of asymmetries in the XPS (X-ray photoelectron spectroscopy) line shapes from core levels of 
these metals, which is also now well documented experimentally. The latter phenomenon is a 
consequence of (3) the orthogonality catastrophe, first explained by Anderson (1967). The 
general problem is often called “MND” after the work of Mahan (1967) and Nozieres and 
deDominicis (1969). 
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The model for free-electron metals is shown in Fig. 9.9. It is an energy-level diagram and 
shows the parabolic states in the conduction band, which is filled with electrons to the Fermi 
energy u. The horizontal lines on the bottom represent core levels in the ions in the metal; 
they are localized, have an infinite effective mass, and are drawn flat on an energy-level 
diagram. In the X-ray absorption process, the photon energy is used to lift an electron from a 
core level to an unoccupied state in the conduction band. At zero temperature, the only empty 
states are above the chemical potential u. In a one-election description of this process, the 
absorption must start at the threshold frequency Aw; = Ep + E, where E, is the core-level 
binding energy, as measured from the bottom of the conduction band, and Ep is the width of 
the occupied conduction band: 


3 
AO) = 2n |e (kia + ple)/PL1 — nr El ho — hor — Ey) (9.141) 


The factor [1 — np] is included to limit the final states to above the Fermi level. As indicated 
schematically in Fig. 9.9, the golden rule result (9.141) predicts that the absorption starts with 
a nonzero value at threshold, so there is a step in the spectrum. The step occurs because the 
electron gas has a nonzero density of states at the threshold frequency, since the electrons are 
going to states right above the Fermi energy. 

The core levels of ions have angular momentum as a good quantum number. The angular 
momentum of the electron state must change by one unit during the photon transition. If the 
core level has an initial angular momentum /, the final state in the conduction band is either 
1+ 1 or/—1. An atomic description of this absorption process would write the one-particle 
theory in terms of these two final state channels: 


3 
A (@) = 27 |e M10) + 4, (OIL — np EJ ho- horë) (9.142) 


where the constants A).,(k) are found from Clebsch-Gordan coefficients and radial integrals. 
In a metal, the final electron state is a Bloch function, which is not an eigenfunction of the 
angular momentum operator. The use of Bloch functions leads to a more complicated 
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FIGURE 9.9 X-ray transition between core levels and parabolic conduction band of a metal. Electron sates are 
occupied below Fermi energy which is E above the conduction band minimum. Absorption must remove a core 
electron and out it into an unoccupied band state, while emission must remove an electron from an occupied band 
state. Also shown are the predicted absorption and emission spectra in the one-body picture, neglecting energy band 
variations. 


Sec. 9.3 e X-ray Spectra in Metals 605 


description, since the final electron states in (9.141) must be summed over the full Brillouin 
zone and its various energy bands. 

The word hole has two different meanings in the present discussion. The first is a core 
hole, which is an electron missing from a core state. The other is a conduction band hole, 
where the electron is absent from an occupied state in the Fermi sea. Core hole is used for the 
first case and hole for the second. 

After the core hole has been created, it has several possible decay channels. The most 
likely is an Auger process, whereby an electron from a higher energy state falls into this core 
hole while its energy is transferred to another electron whose energy is correspondingly 
increased. The electron which gains energy can also be measured, and this Auger spectro- 
scopy 1s another important experiment (Gallon, 1978). 

The second decay channel is the emission of a photon by an electron which falls into the 
core hole. When the electron starts in the occupied states in the conduction band, this X-ray 
emission spectroscopy 1s an experiment which is complementary to the absorption. The 
emission provides a measurement over the states, below the Fermi level, which are occupied 
in the ground state of the Fermi sea. The absorption measures the unoccupied states above the 
Fermi energy. The emission spectrum also has a sharp step at the threshold frequency 
hOr = Ep + E., as shown schematically in Fig. 9.9. These steps, in emission and absorption, 
are called Fermi edges. 

There are several physical mechanisms which can change the shape of the spectrum, in 
emission or absorption, from the simple one-particle theory in (9.141). The first of these is 
due to the lifetime of the core hole, which is usually dominated by the Auger decay process as 
just described. This core hole lifetime is quite variable among the various core levels of an ion 
or atom. The core hole lifetime is longer for the outer valence shells than for the inner states 
closer to the nucleus. Some experimental values for core hole lifetimes in simple metals are 
shown in Table 9.1, from data of Citrin et al. (1977). The same group recently remeasured 
these values, Wertheim et al. (1992). The Auger lifetime is expressed as an energy uncertainty 
I’ = h/t (where I is the full width at half maximum). Values as high as several electron volts 
can be found for inner shells of atoms with high Atomic number. This Auger decay generally 
imparts a Lorentzian broadening to the X-ray spectra. It can be included, semiempirically, in 
the Kubo formula as a factor exp(—I'|t|/2). If the X-ray measurement is to determine other 
phenomena, it is desirable to keep I’, as small as possible, which is done by measuring the 
spectra of outer atomic shells. 


TABLE 9.1 Core level data at T = 300K. T is 
Auger width while y is phonon width. From 
Citrin et al. (1977) 


Metal State I (eV) y(eV) x 
Li ls 0.03 0.37 0.23 
Na 2p 0.02 0.18 0.20 
2s 0.28 0.20 0.20 
ls 0.28 0.18 0.21 
Mg 2p 0.03 0.18 0.13 
2s 0.46 0.19 0.13 
ls 0.35 0.20 0.15 
Al 2p 0.04 0.11 0.12 


2s 0.78 — 0.12 
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Another many-body effect is the broadening due to phonons. The only term which is 
usually kept in the Hamiltonian is the phonon coupling to the core hole, which is a term such 
as 


V=d'd 2 MDa ta!) (9.143) 
q 


It was shown in Sec. 4.3 that this type of phonon coupling to a localized level is a type of 
Hamiltonian which is exactly solvable. The optical absorption from such a coupled level was 
also solved, and the Kubo formula has a factor of exp[—(¢)], where 


2 


109 =H] Sg 0 -rot = e) +0 
+ [1 + itw,(q) — e” Jng(w, (q))} (9.144) 
xy, v= 22 IM (a) EL + 2ng(@,(q)] (9.145) 
q 


The term linear in ¢ is the self-energy term. The factor exp(—ọ) is added to the argument of 
the time integral in (8.3.3). Usually the Auger width T is large enough that only the short time 
response is needed of the correlation function. Anything that is going to happen must do so 
quickly, before the core hole decays by the Auger process. The short time limit of the phonon 
contribution is p ~ yz? as shown above. The spectral function is broadened by the Auger and 
phonon contributions 


OO 
Aa) =| HAO O expl FTN- 77) (9.146) 
—&OO 

The phonons add a Gaussian broadening to the spectral function for the X-ray process, where 
the Gaussian component y is temperature dependent. It is relatively easy to experimentally 
separate the phonons from the Auger width, which is a temperature-independent Lorentzian. 
The only remaining problem for the phonon contribution is to calculate the matrix elements 
M (q) for the coupling to the core hole. However, this calculation is quite delicate (Hedin and 
Rosengren, 1977). 

The phonon broadening for the emission spectra is a much more subtle calculation. Here 
the problem is that the phonon system does not equilibrate to the presence of the core hole 
before the emission occurs. One expects that the minimum time for the phonons to come to 
equilibrium around the new core hole, and its screening charge, is the inverse Debye 
frequency fp, ~ 1/@p. This time is generally much longer than the lifetime for the Auger 
effect, except for the outer shell of electrons, as shown in Table 9.1. The theory of phonon 
broadening of emission must account for the nonequilibrium states of the phonons. This 
theory was developed simultaneously by Mahan (1977) and Almbladh (1977). The most 
interesting case yet known is lithtum, for which phonons are the most important of all the 
mechanisms which broaden the edge. The theory predicts a double shoulder in the emission 
spectra because of incomplete relaxation of the phonons. The prediction was verified by the | 
experiments of Callcott and Arakawa (1977). 

These many-body processes, Auger and phonon, are reasonably straightforward. The 
more interesting aspects are the response of the many-electron system to the appearance of the 
core hole, which are discussed in the remaining parts of this section. 
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9.3.2. Edge Singularities 


Mahan (1967) predicted that the absorption edges for X-ray transitions in metals would 
have a power law divergence of the form 


A(@) = A (@) (=) (9.147) 


O— Or 


where &) ~ Ep is a bandwidth, and the exponent « is discussed below. His theory resulted 
from an investigation into the properties of excitons in metals. The Wannier picture applies 
equally well to metals, so that the optical transition should be viewed as the simultaneous 
creation of an electron and a core hole in the metal. These two particles will interact in the 
final state. Presumably this interaction is screened, although a detailed theory of the screening 
process is still an active subject of research (Canright, 1988). Mahan assumed that the 
electron core hole Coulomb potential is screened instantly at the time the core hole is created. 
This approximation may be valid near the absorption edges but surely is unreliable farther 
from threshold. 

The exciton theory predicts that the exponent « in (9.147) is positive, so the absorption 
diverges as a power law singularity at threshold. Later work has shown that the renormali- 
zation catastrophe of Anderson tends to make « negative. The final values of x may be either 
positive or negative, depending on which of these two factors is most important. Not all edges 
are singular. 

The calculation assumes there is instantaneous screening of the electron—hole interaction 
in the metal. The model Hamiltonian for the X-ray absorption process is: 


1 
H =€,d'd+ E(K)CLC, + =} Vk, k’)\C}Cyd'd (9.148) 
k kk’ 


The spin indices have been dropped from the notation. The last term is the screened Coulomb 
interaction between the electron and the core hole, while the first terms are Hy for the 
electrons and the core holes. The Hamiltonian is written in the initial state basis, so the 
electron operators (Cx, Ch) refer to the state which does not have a core hole. For the Wannier 
exciton theory it was shown that there is a great advantage to working in the final state basis 
set, but that is not the case here. The calculation is about equally difficult in either basis set, 
and the same answer is found from either starting point. The difference between the present 
calculation for the metal, and the earlier calculation for the insulator, is that there are now 
N ~ 107 conduction electrons which also interact with the core hole, and they are initially in 
states described by the initial state basis. In the calculation, it is now necessary to keep the 
terms n;-(€,) for the electron occupation number. They are the cause of the dramatic change in 
the theory, and the prediction (9.147) of the edge singularity. 

In the exciton calculation, the vertex diagrams to the correlation function are included by 
summing the set of diagrams in Fig. 9.10. To keep the physics simple, the calculation is first 
done by assuming some approximations which tend to reduce the number of superscripts and 
subscripts: (1) the interband matrix element ( f|€ + p|c) is taken to be a constant wọ; (2) the 
screened Coulomb interaction is taken to be a constant V(k, k^) = Vo up to a cutoff energy 
Éo œ% Ep which is a typical bandwidth. With these approximations, the ladder diagrams in the 
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FIGURE 9.10 
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vertex summation in Fig. 9.10 can be written as in the previous section: 


(ia) = > (ia) (9.149) 
n=0 


n+l 
n” (iw) — we(— Vo)" | d'k T aee eG (9.150) 


(2m)? im — wr — ők 


Removing the wave vector dependence of the scattering potential uncouples the wave vector 
integrals. One is left with a product of (n + 1) integrals which are identical. The integrand of 
the wave vector integral contains the factor 1 — n-(€,) — np(E;,), where the core hole term 
nr(G,) is set equal to zero. This step leaves 1 — n-(&,), which ensures that the electron 
scatters into unoccupied states. The summation over wave vectors can be changed approxi- 
mately to d'k > (20) NrdÉy, where Np is the density of states. The dé, integral is just a 
logarithm, and the ladder diagrams give the result at zero temperature 


Eo _ _j 
Ne| dé, EG _ y, In( 2) (9.151) 


—& 10 — Or — Ék 0 


. n+1 
nia) = —w2Np(—NpVo)" in (2 F 2) (9.152) 
0 


These approximations are fairly crude. They are meant only to apply to the threshold region, 
where œ © œp, so that the cutoff in the wave vector integration by the Fermi occupation 
factors np is most important. The vertex summation leads to a summation over logarithms of 
increasing powers. 

This series should not yet be summed, since other vertex diagrams contribute terms 
which also contribute to the series. The first diagram in the series, which is not a ladder, is that 
shown in Fig. 9.10(d). It contains six Green’s functions as internal lines. The following 
summations over Matsubara frequencies must be done: 


l 
BE pari Se Ek + q, ik + igA k +q + a ik + ig + ig’) 
ik,iq,iq' 
x G,(—k, iw — ik)G,(—k — q', iœ — ik — ig’) 
x G,(—k — q — q’, iw — ik — ig — ig’) (9.153) 


These summations are done in sequence. The core hole Green’s functions all have the same 
energy č, which is independent of wave vector. The conduction electron energies will be 
denoted by the symbols & = €,(k), & = ¿(k + q), E” = E(k + q + q’). When doing these 
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summations, the core hole occupation factor np(&,) is set to zero wherever it occurs. With 
these conventions, the three summations are, in sequence, 


l 
B> G(k+q+q)G,(iw — ik — ig’)G,(iw — ik — ig — iq’) 
ig’ 


_ 1 — n,-(&") 
(ie — &" — &, (iw + ig — &" — &,) 
1 GAk+Q _ nr(&) 
Bigiot+iq—f°-&, im—ik+6—-&"-&, 
1 G (k)G,(iw — ik) 1 — n,;(&) 


Big ia —ik +E -E -é (io -§ — &, io -E + & — E" = En) 


The vertex correction is obtained from the integral (€, = œr) 


So Eo Éo 
nDo) = VNO? | a| a| dE” 
—€ —§o —§o 
y [1 — nr Hnr E — nr) 
(iœ — é — wr)(im — E + & — &" — œr )lio — &" — @r) 


This integral is more difficult to evaluate than those in the previous cases, since the middle 
energy denominator contains all three conduction energies, —& + &’ — &”. The integral over 
dé’ is over the occupied states n,(&'). These terms could be ignored in the insulator where no 
is zero. They must be included in the metal where ny Æ 0. 

The triple integral in (9.154) is obviously complicated. The most divergent term it yields 


(9.154) 


is 


. 3 
reio) = tw Np (NF Vo) in (“= *)| (9.155) 
0 
This term is one-third the contribution of the double ladder diagram in Fig. 9.10(c) and with 
the opposite sign. 
A similar calculation can be done for the third order vertex corrections. Each of the six 
terms with three vertex lines in Fig. 9.10(e) produces terms which have {In[(@; — io)/ Eq} . 
The summation of all the terms shown in Fig. 9.10 gives the series 


News 


nlio) = —-— 2 (L- 1? +223 —1 14+...) (9.156) 
Np Vo 
L= N-V, in( PP 2) (9.157) 
0 


The series is now summed to produce the final result: 


nim) = -—-°-. (1 — e”) (9.158) 
0 


News Eo \ elo 
-- 74 |; - (<n) 0-199) 
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The optical absorption is the spectral function of this operator. Note that the right-hand side is 
real for œ < w7, but complex parts are generated when œ > @, 


A(@) = —2 Im[x(o)] 


N 2 2N; Vo 
— _ Ne iml (=) exp|—i21N; VOl — oni| 
T 


V5 sin(onN V) _ So eo — 0 ) (9.160) 
A F'0 O — Or) T 


Equation (9.160) is Mahan’s (1967) result that the absorption diverges as a power law, where 
the exponent is the dimensionless quantity x = 2N;-V>. The exponent is obtained by summing 
the most divergent diagrams in each order of perturbation theory. These terms are divergent, 
because the retarded function In[(@; — @)/E ] can become quite large near threshold œ ~ or. 
In the limit where Vọ = 0, the above formula goes to the noninteracting result that 
A = 2nN,-w2@(w — wr), and the absorption edge is a simple step. 

The correlation function (9.77) for the model Hamiltonian (9.148) can be evaluated 
accurately using simple analytical formulas. Pardee and Mahan (1973) showed that the edge 
singularities could be derived in a simple way using dispersion theory. Penn et al. (1981, 
called PGM) obtained an analytical solution to the multiple scattering problem, including all 
vertex corrections. They showed that the leading term in the series was the result of Pardee 
and Mahan. Ohtaka and Tanabe (1983, 1986) obtained a more elegant analytical solution, 
which they illustrated with numerical examples. This work has provided a complete solution 
to the MND problem, and in an analytical form that is useful for computation. 

The result of Pardee and Mahan is obtained by a simple argument. Let T(w) be the 
complex scattering amplitude for an ingoing wave of an electron scattering from a central 
potential, which in this case is the screened core hole. If 7’(@) is the amplitude of the 
outgoing scattered wave, they differ only by a phase factor which is twice the phase shift: 


T'(@) = T(@) exp[2i5,(@)O(@ — ©r)] (9.161) 


where / is the angular momentum of the outgoing electron. The step function is a reminder 
that the phase factor exists only in the region of absorption, since Tand T’ must both be real 
in regions where there is no absorption. The next assumption is that T and 7” both originate 
from the same analytical function of frequency: 


t(@ + id) = T'(w) (9.162) 
t(@ — ið) = T(@) (9.163) 
Combining these results shows that 
t(œ + ið) = exp[O(@ + i5)] = |T(@)| exp[+i5,(@)O(@ — w7)] (9.164) 
Im[O(@ + 15)] = £6,(@)O(@ — @7) (9.165) 


Dispersion theory assumes that (z) in (9.164) is an analytical function of complex frequency 
z. The only function that has the property (9.164) is 


1 botor 
| de 0/(€) 


%0) == » Enz (9.166) 


Sec. 9.3 e X-ray Spectra in Metals 611 


As usual &ọ is the bandwidth. The function o(z) is a dispersion integral. The integral can be 
evaluated by writing 6,(€) = 5,(@) + [ô (£) — 5,(@)] and the first term is an easy integral: 


dlo + 18) = ÈO D fin tore + inO(o — or)! +K(o) (9.167) 
_ ole) — ô (0) 
K(o) = z ifa ~; o (9. 168) 


The function K(œ) is a smooth function of œ since the integrand is real and not singular. The 
amplitude of the X-ray absorption is proportional to 7’(@) and the intensity is proportional to 
|7’(w)|?. This factor can be evaluated using the dispersion integrals 


an [28(@)/ 
IT (œ) = so FOr @ 2K(@) (9.169) 
Or —@ 
we E 28/7 
A(@) © snaa (> 0 =) ©O(o — @,) (9.170) 


This equation has the same edge singularity as (9.160). Here the exponent of the power law is 
a = 26,()/n rather than 2NF Vo. Note that if one is calculating the phase shift in the Born 
approximation, then one gets that sin(6) ~ 6 © nNF- Vo. Nozieres and deDominicis (1969) 
were the first to realize that the exponents of the edge singularities were functions of the phase 
shifts. At the threshold frequency œ ~ œp then the exponent depends upon 6,(@,), which are 
the phase shifts of conduction electrons at the Fermi surface for scattering from the core hole. 
The factor of K(@) is a smooth function of frequency and does not contribute to the singular 
behavior. It has an influence upon the calculation of the X-ray absorption for other values of 
frequency. 

In X-ray absorption an electron absorbs an X-ray, departs the atomic core, and becomes 
a conduction electron. The core potential changes from having a charge of Z to having a 
charge of Z + 1. All of the other conduction electrons have their central potential suddenly 
changed by the appearance of this new core hole. The switching-on of the potential may cause 
electron—hole pairs to be created. The pairs act as bosons. The theory for this process can be 
treated by the independent Boson model of Sec. 4.3. 

The electron that departs the core state and becomes a conduction electron may also 
create excitations of electron-hole pairs. The departing electron has only a small probability 
of creating pairs. This process may be accurately calculated by perturbation theory. 

X-ray absorption has a number of factors that contribute to the shape of the spectra. The 
above discussion derived the contributions of phonons, Auger decay, and the final state 
Coulomb interactions. Another important contribution is the shakeup mentioned above. The 
shakeup contributes a time evolution p(t) which is calculated in the next section. All of these 
various factors contribute a time dependence to the evolution of the absorption. The 
experimental spectra are found by Fourier transforming the product of all of these factors 


A(@) = p dtp(t)A(t) exp[it(@ — wr) — TIt] — y77] (9.171) 


The factor A(¢) is the Fourier transform of A(@) in (9.170). Instead of transforming A(), an 
alternative is to represent (9.171) as a convolution integral. The evaluation of p(t) is presented 
in the next section. 
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9.3.3. Orthogonality Catastrophe 


The X-ray absorption creates a core hole in the midst of the electron gas. It also creates 
an additional conduction electron, but the latter process has been discussed in the previous 
section. Now consider the impact on the N ~ 10% other conduction electrons of this sudden 
appearance of a core hole. The core hole is again represented as a static potential V The 
physics is to investigate the response of the free-electron gas to the sudden appearance of a 
new potential V The result has been labeled a catastrophe because the transition is forbidden. 
The word forbidden is used in an unusual way, which is now explained. 

The important physics is that the ground state wave functions of the conduction electron 
system, with and without the core hole potential V are orthogonal. The transition is forbidden 
because it is the transition between two orthogonal states. The X-ray absorption is observed in 
actual metals, because the conduction electron system is excited by the creation of electron— 
hole pairs. The pair creation is a symmetry-breaking process, which allows the forbidden 
transition to be observed. The orthogonality was first suggested by Hopfield (1967, 1969) and 
proved by Anderson (1967). 

It is important to realize that the orthogonality between the two wave functions, with and 
without the core hole potential, is between the N-particle wave functions in the limit where 
N — œ. The single-particle wave functions are not orthogonal. As a simple example, assume 
that in the ground state of the electron gas, before the appearance of the core hole, the 
conduction electrons can be described by single-particle wave functions with no potential, 
such as plane waves. The derivation follows Anderson who considered only s waves, so that 
the wave functions have the form (kr) = sin(kr)/kr. In the Hartree-Fock approximation, the 
N-particle wave function can be described by an N-dimensional Slater determinant of these 
orbitals. The spin indices are omitted, so write the initial state of the system as 


(kyr) O(kyr2) = (ry) 


1 lkr) lkr) ++» ary) 


Pi r2 Tw) = Te (9.172) 


O(Ayr;) (kyra) ++: O(Ayry) 


After the appearance of the core hole potential, the conduction electrons adjust their indi- 
vidual wave functions to the presence of this potential. Far from the potential region, the wave 
functions can be described by '(kr) = sin(kr + 5)/kr, where 6 is the s-wave phase shift for 
one-electron scattering from the core hole potential. The new N-particle wave function is just 
a Slater determinant with these new orbitals 9’: 


p'(kiri) b'(k; ry) ree p (kiry) 
1 | O Cor) okr) +: pry) 
Ppt. ror) = Te . . . . 


(kyr) p (kyr) ++ b'(kyry) 


(9.173) 


During the X-ray transition, one takes the matrix element between the initial and final states of 
the system. In a one-particle picture, the matrix element | d°’r;(k,r)é - po,(r) is between the 
core wave function 6,(r) and a conduction electron o,(kr). The many-particle calculation 
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computes this matrix element between the N-particle states. It should include the overlap 
between these two Slater determinants, 


S = Jer .. Pry Oe (r soe ry)O(r; see ry) (9.174) 
which has been shown to be 
S = N? (9.175) 
52 
x = 2= (9.176) 
T 


The many-particle overlap S turns out to be a negative power of N, where the exponent is a 
function of the phase shifts. This overlap vanishes in the limit where N — oo and is small for 
real systems which have N ~ 10? and « is typically 0.1-0.2. 

There is no orthogonality between the individual matrix elements of single-particle 
orbitals (kr) and o’(kr). This single integral is nonzero. When these overlaps are evaluated 
in the determinant for all the possible combinations, the result is asymptotically zero as 
N — oo. The analogous situation occurs in atoms but on a less drastic scale. For an atom 
with N electrons, the optical absorption can cause one electron to change from one energy 
state to another. The matrix element for this process is calculated for the full N-particle wave 
function of the atom. The N-particle matrix element can be reduced to a one-particle matrix 
element between the two primary states which are the initial and final levels of the electron, 
times a factor S which gives the overlap of the other N — 1 electrons. They each change their 
orbitals a small amount during this change of state by one electron, and the N — 1 overlap 
function has a typical value of 0.95. Then S is near unity, and the N-particle overlap is a small 
effect in atoms with a few electrons. The metal case is the extrapolation to N ~ 10% elec- 
trons, where the product of one-particle overlaps, each slightly less than unity, eventually 
produces a vanishing matrix element. 

It is necessary to turn these ideas into a dynamical theory of the absorption process. 
Nozieres and deDominicis (1969) started by examining the Green’s function for the core hole: 


G, = —i(Td(t)d" (0)) = iOO (|e dea" |) (9.177) 


They manipulate this correlation function to show that it can be treated as the many-electron 
response to the sudden switching on of the core hole, in the same way that the phonon 
response to sudden switching was studied in Sec. 4.3.5. The core hole creation operator d*|) 
acts on the ground state |) to create a single core hole. The operator exp(—itH) on this state is 
represented by the final state potential with the core hole: 


eH qt) = ertt (9.178) 


The core hole destruction operator d destroys the core hole state. The last time operator 
exp(itH) operates on a configuration with no core holes, so that it is H, for the ground state. 
The hole core Green’s function can be exactly given by 


G, (D = =i "r O(A (Jee "ry (9.179) 


The core hole operators (d, dt) are now omitted because they have done their work of 
determining the proper order of the Hamiltonian operators. There are several different ways to 
evaluate the time dependence of this function. One way is to recognize that the Hamiltonian 
H; = ł_; h; is a summation of the single-particle Hamiltonians for the individual conduction 
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electrons. The hole Green’s function can be expressed as a Slater determinant analogous to 
(9.166) for the absorption rate. In fact, it is just the function p(t) in (9.169) which is the matrix 
element over the ground state orbitals of exp(—ith,) 


p(t) = (lee |) = ele det[dyy Opp <x, (9.180) 
G,(t) = —iO(He~"”" p(t) (9.181) 
bpp (t) = (ple |p’) (9.182) 


The evaluation of this determinant is one possible way to find the core hole Green’s function. 
Methods of numerical evaluation have been discussed by Schénhammer and Gunnarsson 
(1978). 

A second way to find the core hole Green’s function is by a linked cluster expansion. The 
correlation function in (9.181) may be written as a time-ordered operator, which is evaluated 
by the techniques described in Sec. 3.6: 


p(t) = (r exp| i| ar, ref) = exp] F(0) (9.183) 
0 l=1 
qy f t 
F(t) = =>] dh- | dt; (TV (t,) --+ V(t) eonnecte (9.184) 
0 0 


The first term in the exponential resummation F’,(t) is linear in ¢, and is a self-energy term: 
f —it 
Fy) = i| da VE) = EE Vs, Wine (9.185) 
The exact self-energy of the core hole is known from Fumi’s theorem in Sec. 4.1.3 to be 


kr 
E; = _ 2S (27+ D| kdk6,(k) (9.186) 
m ] 0 


The factor V(k, k) is the first term in an expansion which should give Fumi’s result £; when 
all terms are summed. It seems reasonable to use Fumi’s result (9.186) for the self-energy, 
which includes all the terms linear in ¢ in higher order. The threshold energy for the X-ray 
transition was previously given as œr, which is the value in the one-electron approximation, 
which neglects the final state interactions. They provide a self-energy E; which further lowers 
the threshold by this amount. The new threshold energy in the interacting system is 
Õr =, + E;, which accounts for the increased binding of the core hole due to the polar- 
ization of the electron gas around it. 

The second term in the exponential series F(t) is the first term which is interesting, 
since it predicts an orthogonality catastrophe: 


(—i)° 
2 


F(t) = | an| anir reve (9.187) 


O 2 ft t 
= l dt, l dt © E Vík, k)V(k;, ky) 


0 0 k,k,o k;k,0’ 


x (T, C ot )Crzolti Cho C2) Cryo'(ta)) (9.188) 
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The operators in the brackets are paired, according to Wick’s theorem, into Green’s functions 
of time. The time integrals are then evaluated directly: 


2 pt t 
FQ = i) faf dt, X, V(ky, K2)V (k3, kı) 


k; k,o 


x G(k,, t — b)G(kK,, t — t) (9.189) 


1 
77 mal an| dh D Vk, k 7) 2 gilt 2h) 61) 


k, k,o 


x [0 — ty) — n llO — 4) — n] (9.190) 


_t 2| itn, nm(l-m) itl, ba) 
F(t) = z xe IV (k;, k2) p -E E- E6} (1 — ] 


The factor of } in front vanished, because each term in the final result appears twice. The 
summation over spin index o can be taken to produce another factor of 2 in front. The first 
term, which is linear in time, is dropped because it contributes only to the self-energy, which 
is already known from Fumi’s theorem. Consider the expression 


2 ne(Sx, yt — np (Ëk, )] 


it(Ex, —Sk,) 
Ek — En) -e ) 


F(t) = -= = |V(k,, ky)| 


This term does predict an orthogonality catastrophe and has been investigated extensively; see 
Minnhagen (1977) for an evaluation. Here the topic is discussed briefly. Rewrite this 
expression by assuming that the hole potential V(k, k’) depends only on the difference of its 
arguments V(k — k’) and change variables to k, = k, + q: 


F(t) = -| £ M R (L — e’) (9.191) 
Rw) = =E Var Maw (9.192) 

q 
Na 1) = =D nG — Me Črta + Er — Ese (9.193) 


The factor A(q, u) is recognized as the imaginary part of the polarization diagram P® (q, u) of 
a single-electron bubble. It was evaluated in Sec. 5.5: 


A(q, w) = —Im[P(q, u)] = u( 2) Ok - q) (9.194) 


The important feature of this result is its linear dependence on the frequency u, which is valid 
at small u. The wave vector integration over q can be done for realistic potentials V(q). A 
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crude approximation to the result is 2(Np A where Np = mkp/27? is the density of single 
spin states at the Fermi energy, while V, is the magnitude of the electron-hole potential. 


gu u<& 
R,(u) = 0 u>, (9.195) 
E m f dq 1 2 9 2 
=5] ag V Oke — a) ~ NEY.) (9.196) 


The upper limit of u is &ọ, which is determined by the range over which R,(u) is linear in u. 
This range is approximately E) ~ Ep, as shown in Sec. 5.5: 


Éo d , 
F,(t) =g | ~q — eit) (9.197) 


The remaining du integral is interesting. The integrand has the factor |, du/u which could 
diverge as In(0). In this case it does not diverge, since the other factor (1 — e~™) ~ iut in the 
limit where u — 0. The integral is finite. However, for large times (Egt >> 1) this cutoff of the 
logarithmic divergence happens at lower and lower frequencies. This dependence can be 
approximately expressed as 

Éo du 

Ft) © -e| P ~ —g ln(1 + it&) (9.198) 
1/it 


The logarithm has the correct two limits of i€,¢ at small time and as In(it&o) at large times. 
This approximation completes the evaluation of the renormalization factor: 


o(t) = exp[—itE, — gIn(1 + it&)] (9.199) 


Its Fourier transform determines the spectral function of the core hole: 


A,(@) = —2 ml | dte” a0] = 2Re| | areen o (9.200) 


—00 0 
The real part of the integral is the same as the integral plus its complex conjugate: 


OO OO 
A, = l dte ©- p(t) +] dte 0-1 p*(t) (9.201) 
0 0 


The variable change t —> —t in the second term, along with p*(—t) = p(t), brings us to the 
equivalent expression 


A, = | dte ©- p(t) (9.202) 
= cress (9.203) 
ee 4 (9.204) 
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FIGURE 9.11 Path of integration in evaluating (9.205). 


The integral is evaluated by a contour integration. Change the integration variable to z = pf. 
There is a branch point at z = i. The branch cut is drawn vertically from this point, up the 
imaginary axis, as shown in Fig. 9.11: 


A, = 


1 oO izQ 
| Á (9.205) 


— dt ————— 
Eo -œ (1 + iz)? 


For Q < 0, the integral (—0o, 00) is closed by a contour in the lower half plane (LHP). This 
closed contour encloses no poles or branch cuts and so gives zero and shows that A;(@) = 0 
for Q < 0. The other situation is Q > 0, and here the contour of integration is closed in 
the UHP, as shown in Fig. 9.11. The contour encloses no poles, so the contribution on the 
real axis (—0o, oo) is equal to that along the cut. For the integration along the branch cut, 
change variables to z=i(1+y). The denominator in (9.205) becomes (1 + iz)? = 
[(1—-G+y > =[-yP =  exp(ting), where the choice of (+) in the exponent depends on 
the side of the branch cut. 
The integrals along the branch cut give 


Q) [& , , 
A,(@) = = Se (eo — e'T8) (9.206) 


- eri — 2s) & 


bo are 
The shape of this spectral function is shown in Fig. 9.12. It is zero for Q < 0 and nonzero for 


Q > 0. It diverges as a power law for Q ~ 0t, with the exponent (1 — g). It is easy to cheek 
that the area under the spectral function is 21: 


(9.207) 


_ i do Alo ) = inte) py o| ine (9.208) 
l= snr) ots” TA — g)T(g) (9.209) 


The last equation is an sdemtty among gamma functions. 

The response of the electron gas has a dramatic effect upon the spectral function of the 
hole. The spectral function has a power law divergence Q'* near the threshold energy. In the 
absence of interactions (V = 0), the spectral function is a delta function A,(@) = 
2nd(@ — @,). As illustrated schematically in Fig. 9.13(a), the influence of interactions is to 
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FIGURE 9.12 Spectral function of the hole Green’s function. The orthogonality catastrophe causes the power law 
distribution and eliminates any delta function character. 


change the delta function into a power law divergence. All quasiparticle behavior, in the form 
of a remnant delta function, has vanished. 

In X-ray absorption, the unperturbed spectrum for V = 0 is a step ©(w — œp) rather than 
a delta function, which is illustrated in Fig. 9.13(b). How does this step become modified by 
the potential V in the orthogonality catastrophe? The answer is that the threshold becomes a 
power law ©? which goes to zero as Q — 0. The threshold behavior may be derived in the 
following way. The Fourier transform of a step function 0(@ — œp) is ¢7!: 


œ dt git(@—or) 


©O(o — 0r) = l (9.210) 


œ 2ni t— ið 


The orthogonality catastrophe modified this spectrum by the inclusion of the term p(¢) in the 
integrand. Call the resulting X-ray spectra X (œ): 


©% dt 1 eto 


Ln 211 
œ 2nit— 15 (1 + ité) (9.211) 


x(a) = | 


The easiest way to evaluate this integral is to note that its derivative is just the hole spectral 
function 


dX (@) _ l 


— 9.212 
do zpr) ( ) 
V#0 
V=0 
(a) — 
V=0 V#0 
(b) | —__- ( 
Q n 


FIGURE 9.13 The effects of the orthogonality catastrophe upon spectral distributions. (a) The delta function of a 
spectral function is changed to a power law divergence. (b) The step function in absorption is changed to a converging 
threshold. 
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This equation has the solution 


do 0) = 20) e (9.213) 


ros) eA) = Tey |, T 


The behavior near threshold may be deduced by examining the integral for very small values 
of Q. Then the slowly varying term exp(—’) can be ignored, and the integral gives 


xo) - 2 o jf d% ÆN) 


Tg) Jo @ E TA 8) (3.214) 


Equation (9.214) shows that the absorption behaves as 0% near threshold. The threshold has a 
converging behavior, with the absorption starting at zero when Q = 0. It is sketched in Fig. 
9.13(b). 

The absorption is zero at the threshold frequency Q = 0* because of the orthogonality 
catastrophe. The transition is not allowed because of orthogonality and has a zero probability. 
The absorption is nonzero for Q > 0 because of the creation of a number of electron-hole 
pairs in the system. This serves as a symmetry-breaking process, which makes the transition 
allowed. 

The role of electron-hole pairs in the X-ray transition was elucidated by Schotte and 
Schotte (1969). The interaction term between the core hole and the electron gas can be written 
in terms of the electron density operator 


pa) = > Chr gisCks (9.215) 


LD Ch Cedidy =F Viap(ad'd, (9.216) 
kk’ ss’ V qs’ 


The density operator can be represented by the boson operators for the electron-hole exci- 
tations of the electron gas p(q) > (6, + bla). The Hamiltonian between the core hole and 
the electron gas is transformed into a problem of a localized level and a boson system. This 
model Hamiltonian can be solved exactly as the independent boson model of Sec. 4.3. The 
solution is entirely analogous to the phonon core hole system. The response of the boson 
system to the sudden appearance of the core hole is described by including the factor of 
exp[—()] in the time integrals The factor o(¢) for phonons is presented in (9.144) and is 
rewritten as 


DO = r+ |” RGD + A = eM) + na = e) 


Rp, (u) = 2 \M,.(q)|"5[u — ñola) (9.217) 
q 


The self-energy term iXt is put first, while the other term is the transient behavior. The 
approximate response of the electron system to the core hole was given earlier as F(t) in 


620 Chap.9 e Optical Properties of Solids 


(9.192). It has exactly the same form as the preceding, except the phonon system is at nonzero 
temperature. The electron—electron response at nonzero temperature is 


°° d . . 
O= -PO | TERO +10 — e) + nG) = 9) 
1f @& 
Rew) =- l Ses May? mPa 1) (9.218) 


The quantity R,,(u) for the phonon system is the phonon density of states, which is weighted 
by the matrix element for the process. Similarly, the factor R,(u) serves the same role for 
electron—electron interactions. It is interpreted as the effective coupling to the electron hole 
pairs. The equations are exactly alike for the response of the core hole phonon and core hole 
pairs, and enter into the spectral function for absorption in the same fashion as factors 
exp(—(f)). This similarity is reasonable, since both the phonons and electron-hole pairs are 
boson systems which respond in the same way to the appearance of the core hole. 

The only difference between these two boson systems is their respective coupling 
constants R,,,(u) and R,(u). This difference is big, since they behave quite differently at small 
values of u. The electron—electron interactions were shown to have a linear relation R, « u 
which leads to power law behavior in A(w). The phonon system is governed by the acoustic 
phonons at small values of u and usually behaves as Rp, « u? in metals. Consequently, the 
exponential factor for the phonon system does not diverge as ~ In(it&,) at large Eot but goes to 
a constant except for the self-energy term. The spectral function A(@) from the phonon part 
alone is a dull function, which was represented as a Gaussian. The core hole phonon and core 
pair couplings cause quite different behaviors in the time response of the system and affect the 
spectral shape differently. The difference arises from the dissimilar nature of their coupling 
function R,,,(u) and R,(u). 

The electron—electron response is an example of a type of phenomenon known as an 
infrared divergence. The divergence occurs in the limit where u — 0, which is the infrared 
part of the spectrum. The quantity R(u)/u? is the probability (at T = 0) of emitting an 
excitation of energy u during the sudden switching on of the core bole potential. For example, 
the number Nu of excitations created is 


oo Éo 
Nu = l M R (u) ~ e| du = —g İn(0) (9.219) 
0 u o u 


The average number Nu of bosons around the particle was discussed earlier for polarons in 
Sec. 4.1. It is infinity for the electron-hole excitations, which shows that an infinite number 
are created. This infinity does not imply a physical disaster, since all measurable quantities are 
nonzero. For example, the energy which is released by the creation of these electron—hole 
pairs 1s 


°° du Šo 
AE = | zz Reluu ~ e| du = g&> (9.220) 
0 0 


which is nonzero. However, a divergence in the number Nu of excitations is usually an 
indication that the spectral function will have an unusual shape, since quasiparticle behavior is 
eliminated. A further discussion is given by Hopfield (1969) and Mahan (1974). The phonon 
system does not have an infrared divergence in the X-ray problem in metals, although it can in 
semiconductors where the piezoelectric interaction is not screened. 
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The exponential series for F(t) = >), F;(t) in (9.183) has been approximated by the term 
F(t) plus i£;t. Further terms in the series also contribute to the orthogonality catastrophe and 
should be summed in order to obtain an accurate description of the phenomenon. The exact 
method of summing all of these terms has been given by Mahan (1982), and more elegantly 
by Ohtaka and Tanabe (1983). They provide an exact expression for R,(u). 

Several other types of results have been obtained. The most important is by Nozieres and 
deDominicis (1969), who show that the exact coefficient of the term which diverges like 


In(it&,) is 


lim F(t) = —itE; + « In(ito) (9.221) 
2 
a= 5 Ea (9.222) 
m,m, l 


The same phase shifts (kp) were mentioned earlier and arise from the scattering of the 
conduction electron. with wave vector kp from the core hole. The core hole is in the center of 
a spherical coordinate system, and the conduction electron states are described by the 
quantum numbers of m,, l, m; and k. The phase shifts are usually taken to be independent of 
m, and m,, in which case the coefficient becomes 


2 

a=2) (20+ yf (9.223) 
i 

The coefficient « should be used in describing the asymptotic behavior of the orthogonality 

catastrophe very near the threshold. 

The orthogonality catastrophe is an important feature of many-electron physics. The 
process of switching on a potential occurs in many circumstances, and the physics can be 
applied to a variety of phenomena. For example, the Kondo effect has this feature, which 
plays an important role in its theory. Whenever the potential is switched on, the system 
responds by making electron—hole pairs, which affects the spectral response of all correlation 
functions which are being evaluated. 


9.3.4. MND Theory 


The absorption of X-rays in a metal causes several different types of excitations and 
phenomena. Two have been discussed which cause power law behavior at the absorption 
edges: excitons and the orthogonality catastrophe. An important contribution was made by 
Nozieres and deDominicis (1969), who solved the two processes together. Their solution is 
asymptotically exact near threshold ©r = w, + E;. It expresses the power law exponents as 
functions of the angular momentum / of the conduction electron in its final state. The many- 
body correction to formula (9.142) with two angular momentum channels 4/4, is 


A(w) = 0(@ — @7) ais (; 2) an Arı (z =~) 7 


(9.224) 
„= 251) _, 
2 
a =2 F (21 + pé] (9.225) 
I 
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The power law exponents «, are the sum of two terms. The first is 26,/m, which is from the 
exciton phenomenon; this term is usually positive in free-electron metals and tends to make 
the edges diverge and become singular. The other factor « comes from the orthogonality 
catastrophe and tends to make the edges converging at threshold. The final value of «, for an 
absorption edge depends on the difference of these two quantities and may be either positive 
or negative. 

There have been many calculations of these phase shifts and exponents for comparison 
with those deduced from experiments. Typical results for sodium are shown in Table 9.2, as 
calculated by Minnhagen (1977). A recent calculation is by Ohmura and Ogiwara (1994). 
Minnhagen used a pseudopotential, which was screened by a Singwi-Sjélander dielectric 
function. The phase shifts should always obey the Friedel sum rule in Sec. 4.1, 


l= SD + 1)5;(kp) (9.226) 
l 


which is just a statement that the screening around the core hole must be one unit of charge. 
The exponents are typical for simple metals. The X-ray absorption from a p shell, where the 
initial angular momentum is / = 1, has the final values for the conduction electron of / = 0 
and / = 2. The asymptotic limit of the Mahan—Nozieres-deDominicis (MND) theory for 


© > Wr is 
A(@) = Ko — Or) a e 2a) +4 (s g) | 


The first term is the s-wave channel. The exponent &ọ is positive, so this term diverges at 
threshold. The second is the d-wave channel where «, < 0 and the term goes to zero at 
threshold. The summation of these two terms diverges at threshold. The theory predicts that 
absorption edges are divergent for electrons from p shells, which is generally observed in 
free-electron metals such as Na, Mg, Al, and K. On the other hand, if the initial core state is 
an s shell with / = 0, the final state of the conduction electron has / = 1, a, œ~ —0.05 for 
sodium. This value is so small that it predicts there is no measurable effect from the many- 
electron response. This prediction is also in accord with experimental observations. The value 
of xı is small because of the partial cancellation of 26, /m = 0.14 and « = 0.19. It seems to be 
a general feature of free-electron metals that the transitions from core hole s states shows no 
additional peaking or rounding of the threshold, in agreement with the predictions that «, is 
very small. MND theory seems to qualitatively explain the behavior of the edges of simple 
metals. 

Figure 9.14 shows the experimental results of Callcott et al. (1978) for the absorption 
edge of the L, , shell of metallic sodium. The p-shell is split by the spin-orbit interaction into 
the p37. and pı; components, which comprise the L; and L, shells. In sodium, this splitting is 


TABLE 9.2 Phase shifts and exponents for 
sodium. « = 0.19. From Minnhagen (1977) 
(used with permission) 


l 8) Q 

0 0.90 0.38 
1 0.22 —0.05 
2 0.01 —0.19 
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0.17 eV and accounts for the double edge shown in absorption. Each of these two edges 
should have a power law singularity. Figure 9.14 also shows the deconvolution of the sodium 
spectrum into the part due to the one-particle spectrum and broadening processes such as the 
Lorentzian width from Auger and the Gaussian width from phonons. The absorption edges 
show definite spikes which are from the edge singularities. The spikes are much smaller in 
emission, although that is partly from the increased phonon width in emission due to the 
incomplete relaxation phenomenon, as mentioned earlier. The value of the edge singularity 
exponent deduced by Callcott et al. to fit their absorption data is % = 0.25, which is smaller 
than the «) = 0.38 predicted from the phase-shift calculations. The theoretical values of a 
have been computed by many different workers, and values similar to those shown in Table 
9.2 are obtained by many different groups, so the theoretical values seem reliable. 

There have been several explanations for this lack of detailed agreement between the 
measured and theoretical exponents: (1) omission of frequency dependent screening in the 
theory; (2) exchange type of effects between the core hole spin and electron spin, as discussed 
by Girvin and Hopfield (1976); (3) scattering of the core hole among its different orbital states 
in the L shell; and (4) the asymptotic solution (9.217) is not valid very far from threshold. 
This subject is still an active area of research. A summary of the present status of this field 
seems to be that the edge singularities are observed in some thresholds of simple metals, and 
the refinements of the theory are still under investigation. 
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FIGURE 9.14 The X-ray absorption edge of the L, , shell of metallic sodium. The data is shown in part (b). Part (a) 
shows how the band structure prediction of Gupta and Freeman (1976) is convoluted with the edge singularity 
(x = 0.25) and broadening functions to fit the spectra. Source: Callcott et al. (1978) (used with permission). 
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9.3.5. XPS Spectra 


X-ray photoelectron spectroscopy (XPS) measures the line shapes of photo-electrons 
excited from the core levels of atoms. A photon, which is usually in the kilovolt energy range, 
excites a core electron to very high kinetic energy. Since the binding energies Eg of the L- 
shell electrons in Na, Mg, and Al are only 50—100 eV, the final kinetic energy of the election is 
E; = hw — Ep. Ina one-electron model, all the photoelectrons excited by the same frequency 
œ would have exactly the energy Ep, and a measurement of the distribution of kinetic energies 
of the photoelectrons would show a delta function at this value. The actual kinetic energies of 
photoelectrons, measured from simple metals, show the main peak at energy E ~ E; plus 
satellite peaks which correspond to the emission of bulk and surface plasmons (Pardee et al., 
1975). The main peak at E ~ E, has a nonzero width, which must be partly caused by the 
usual mechanisms of (1) Auger lifetime of the core hole and (2) phonon broadening due to the 
coupling between the core hole and the phonons. 

Doniach and Sunjic (1970) predicted that the orthogonality catastrophe should make the 
shape of the main line, at E ~ Ep, asymmetric with an asymmetry parameter which is a direct 
measurement of the index «. Their argument is that this experiment directly measures the 
spectral function of the core hole, since the outgoing electron has too much energy to be 
affected by the interaction with phonons or electron-hole pairs. In the calculation of the Kubo 
formula for the absorption rate, the conduction electron operators [Cho Cpo] can be removed 
from the correlation function on the grounds that the electrons leave the core hole too rapidly 
to be influenced by exciton or other final state processes. The correlation function is factored: 


Alo) = f dte” IAIO) (9.227) 
- Al dte™ Y w(k)*w(k')(C, (ACE ,(0)) (d,(t)d4(0)) 
—00 kk’ ss’ 
— =| dt Y` eb wk} (dy (d1 (0) (9.228) 
© J—oo k,s’ 


The Kubo formula for the absorption becomes proportional to the spectral function 
A,(@ — &,) for the core hole. In this experiment, the photon energy œ is usually fixed, and 
measurement is made of the distribution P(E) of final kinetic energies E = €, — Ew of the 
conduction electron, where Ey is the work function of the metal. XPS spectra can now be 
measured with millivolt energy resolution, although the electron energy EF is often over a 
kilovolt. The matrix elements w(k) are a constant over the narrow energy range of €, which 
are measured for each XPS line. The probability P(E) is just the spectral function of the core 
hole A,(w — E — Ey) 


P(E) = wf (E — Ep) (9.229) 
f(b) = | dt exp[—itE — 51 |t| — yY — aln(it&))] (9.230) 


where E; = Aw — Ey. It has the power law singularity in (9.207) and is illustrated in Fig. 
9.12, which is then broadened by the phonons and the Auger decay. The line shape is 
predicted to be asymmetric because of the orthogonality catastrophe, and the degree of 
asymmetry is just given by the index «. The formula used for interpreting the experimental 
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line-shape function f(E) contains the Lorentzian width from Auger processes, the Gaussian 
width from phonons, and the asymmetry from the electron-hole pair creation. 

An early experimental result is shown in Fig. 9.15 from data of Citrin (1973) for metallic 
sodium. The dashed line indicates the excess line shape on the low-energy side. Many more 
spectra have been published by Citrin et al. (1977). One feature of the experimental results is 
that the same index « fits the asymmetry for different core holes in the same atom: the same « 
is found for the different shells K, L,, L}, etc. The ion with a core hole looks the same, to 
electron-hole pairs which are outside the ion, regardless of the particular state of the core 
hole. 

The XPS line shape is asymmetric, because of the emission of electron-hole pairs during 
the creation of the core hole. The photon energy is fixed at Aw, and any energy used to make 
the pairs must be subtracted from that carried away by. the conduction electron. The 
conduction electron has a high probability of having energy less than Ep. This explanation 
accounts for the sign of the asymmetry and explains why the line tails to the left in Fig. 9.15. 
These asymmetric line shapes had been observed for many years, as reviewed by Parratt 
(1959). He speculated then that the “excess width on the low energy side is essentially 
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FIGURE 9.15 X-ray photoelectron spectra from sodium 2s electrons in (a) metal and (b) hydroxide. The metal 
spectra is asymmetric, with additional contributions on the low-energy side due to pair production. These are lacking 
in the insulator. Peaks to the left are plasmons in the metal. Source: Citrin (1973) (used with permission). 
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attributable to transitions between excitation states of the valence-electron-configuration 
type.” The quantitative theory of Doniach and Sunjic has verified this hypothesis in great 
detail and provides the most direct method of measuring the orthogonality catastrophe. 


PROBLEMS 


1. Derive the nonzero temperature form of the free-polaron absorption (9.51) assuming Maxwell- 
Boltzmann statistics. To order «, an exact result can be obtained in terms of Bessel functions 
K,(5 Blo + wl). 


2. Consider the force—force correlation function for scattering from an impurity. Discuss whether the 
multiple. scattering from an impurity can be represented by a T matrix or similar series. 


3. Derive the Wilson—Butcher formula (9.64) from the golden rule. Find the matrix element ( /|€ ° pli) 
by writing both initial |i) and final |f} Bloch states to first order in the potential Vg. 


4. Consider a Hamiltonian which is the homogeneous electron gas plus the crystal potential 
X Vgp(G). Discuss the summation of terms which occurs in higher order when evaluating the 
force—force correlation function. 


5. Where do the final state interactions for the Coulomb scattering of the electron and hole appear in 
formula (9.64) for interband transitions? 


6. Show that the rate of photon emission in an insulator with a nonequilibrium distribution of electrons 
and holes is governed by the matrix element lw(k)I7 as in (9.132). Assume the initial state of the system 
at zero temperature is |k) = d‘ a} 0), and use arguments analogous to those following (9.139). 


7. Write out the correlation function nm) in (9.105), with one vertex diagram, for the frequency- 
dependent screening function (7.171). Do the Matsubara summations. 


8. Show there is an infrared divergence in the X-ray response resulting from the piezoelectric electron— 
phonon interaction in insulators (Sec. 1.3) when used in the response function (9.217). 


9. Consider the X-ray edge problem with an interaction term H,, in Sec. 1.4.2 between the conduction 


electrons and the core hole. What is the contribution of this term to the orthogonality index « from the 
cumulant F(t) (Girvin and Hopfield, 1976)? 


10. Show that the imaginary part of the retarded correlation function (8.34) has the following sum rule 
relating to the average kinetic energy: 


o q e 
i z- n(o) = T Exe (9.231) 


Chapter 10 
Superconductivity 


The theory of superconductivity was formulated by Bardeen, Cooper, and Schrieffer (1957) 
and is called the BCS theory. It very successfully describes the superconducting properties of 
weak superconductors, such as aluminum, which are weak because of the small strength of 
the electron-phonon interaction. Further refinements of the theory have led to the strong 
coupling theory of Eliashberg (1960) which describes well the properties of superconductors 
such as lead. The distinction between aluminum and lead is roughly determined by the value 
of the electron-phonon mass enhancement factor à, as shown by McMillan (1968). The BCS 
theory is discussed first. It must rank as one of the great successes of many-body formalism, 
since the theory provides detailed agreement with experiments. This agreement is a refreshing 
change from most comparisons between many-body theory and experiment, where the results 
often depend upon vertex corrections, correlations, and computer simulations. The beauty of 
BCS is that it is, mathematically, a simple theory which is exceedingly accurate. The reason it 
works is that the basic coupling forces are weak, and mean field works well. 

The basic idea of BCS theory is that the electrons in the metal form bound pairs. Not all 
electrons do this, but only those within a Debye energy of the Fermi surface. The bound states 
of the electron pairs are not described by simple orbitals such as used for the hydrogen atom 
or positronium. The pair state, and the entire ground state of the superconductor, requires a 
many-body description. 

The Debye energy enters into the ground state description because the attractive forces 
between electrons, which are responsible for the pair binding, are due to the electron-phonon 
interaction. Fröhlich (1950) was the first to realize that electrons could interact by exchanging 
phonons and that this interaction could be attractive. He was the first to suggest that super- 
conductivity was caused by the electron—phonon interaction. The phonon dependence would 
explain the experimental observation that the transition temperature T, is a function of the ion 
mass for different isotopes of the same metal. This isotope effect was discovered for the metal 
Hg by Maxwell (1950) and Reynolds et al. (1950), where the dependence was proportional to 
AT./T. ~ —bM/(2M). The BCS theory explains this in detail, since it shows that the tran- 
sition temperature is proportional to the Debye frequency kT, ~ ©p x M~'/*. The isotope 
effect verified the Frohlich hypothesis that the electron-phonon interaction caused super- 
conductivity. 

Another piece in the theoretical puzzle was supplied by Schafroth (1955), who showed 
that a charged boson gas, when undergoing a Bose—Einstein condensation, would exhibit 
many of the superconducting properties known at that time such as the Meissner effect. His 
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theory does not explain other features, such as the energy gap in the excitation spectrum. 
Schafroth speculated that in the superconductor the “bosons are resonant two electron states.” 
The BCS theory uses a similar mechanism, since the paired electrons behave, in some 
respects, as bosons. The BCS theory was not conceived in a vacuum but among many related 
ideas, each of which contained some element of truth. 

BCS was the first theory to explain superconductivity in metals and also made a number 
of remarkable predictions. The foremost was that an energy gap existed in the excitation 
spectrum of the superconductor. The actual observation of this energy gap by electron 
tunneling (Giaever, 1960) provided a dramatic verification of the theory, although BCS earlier 
argued that the thermodynamic data supported the existence of a gap. Many different 
experiments in weak superconductors have shown that the original version of the theory is 
correct in its many details. An extensive comparison between theory and experiment is 
provided in the two volumes of Superconductivity edited by R.D. Parks (1969) and in the 
books by Rickayzen (1965) and Schrieffer (1964). 

In 1987 superconductivity was discovered in materials with planes of copper oxide, 
which is now the field of high-temperature superconductivity. The highest value of the 
transition temperature at ambient pressure is T, = 133K in the Hg—Ba—Ca—Cu—O system 
(Schilling et al., 1993). Although much is known about these materials, the basic mechanism 
causing superconductivity at high temperatures is still unexplained. Even the symmetry of the 
energy gap is uncertain. Most experiments support the d-wave picture, but some still support 
S-wave symmetry. The electron seemed to be paired in spin singlet states, with a very short 
coherence length. It is also interesting that experiments which were important in under- 
standing elemental superconductors, such as electron tunneling or infrared absorption, proved 
to be difficult to interpret for the cuprates. 


10.1. COOPER INSTABILITY 


The first inkling of the BCS theory was a letter by Cooper (1956), who pointed out that 
the ground state of a normal metal was unstable at zero temperature. A normal metal is 
defined as one which is neither superconducting nor magnetic. The instability is an indication 
that the metal prefers to be in another state, in this case the superconducting one. The 
demonstration of an instability does not provide a description of the superconducting state, 
but it did suggest that the instability was caused by the scattering between pairs of electrons, 
where the scattering potential is the exchange of phonons. 

Two electrons can scatter as shown in Fig. 10.1. The screened interaction between two 
electrons was derived in Sec. 6.4: 


2 
20A Ma 


_ 4 “a 
sh) = o) * a A 


(10.1) 


It has two terms. The first is the screened Coulomb interaction. The theory of super- 
conductivity is applied at low temperatures, where the energy exchanged between particles, 
while scattering, is also low. Requirements of crystal stability require that this interaction be 
repulsive at zero frequency (Allen et al., 1988). There have been numerous attempts to bind 
two electrons using just the first term which has only an electron—electron interaction. Then 
the plasmon is the boson which is exchanged between electrons. Careful calculations, 
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(a) (b) 


FIGURE 10.1 The pairing force in a superconductor is due to the exchange of excitations, such as phonons, between 
pairs of electrons. This interaction is frequency dependent and has attractive regions for nonzero frequencies. 


including local field corrections, show that the binding force from plasmon exchange is 
negligibly small. 

The second term in (10.1) is the screened electron—phonon interaction. It is, on the 
average, weaker than the repulsive Coulomb interaction. However, for frequencies near to the 
Debye (© < @p) the energy denominator becomes small and negative, which causes a 
relatively large interaction over this narrow range of frequency. The frequency dependence of 
V (q, œ) is illustrated schematically in Fig. 10.1(b). It may be possible for two electrons to 
bind if they can construct a relative wave function which selectively uses the frequency region 
which is attractive. Not all metals are superconductivity; the alkali metals are not. They have a 
weak electron—phonon interaction. So the attractive interaction needs some minimum strength 
in order to cause superconductivity. 

There are two ways to proceed with the theory. One is to use the full frequency- 
dependent interaction, which sums over all phonon wave vectors and modes. This realistic 
calculation is done in a later section. The other possibility is to replace (10.1) with a model 
interaction of the form 


—Vo for |,| < @p 


V.(q, ©) = | 0 (10.2) 


for |,| > Op 


This potential is constant and attractive (Vo > 0) up to a cutoff energy which is of the order of 
the Debye energy œp of the solid. This second form of the interaction permits a much simpler 
discussion of the theory and allows the physics to be introduced more easily. The initial 
discussion of superconductivity uses the model potential in (10.2), which follows the 
historical pathway of Cooper and BCS. Later, when the physics is better understood by the 
reader, the theory is redone from the beginning using the realistic potential (10.1). The latter 
is called strong-coupling theory. 

Cooper’s model of a normal metal at low temperature was a free-electron system. In the 
limit of zero temperature, the Fermi surface has a sharp step in energy. The electrons are 
allowed to have a weak attractive interaction as given in (10.2). Consider the mutual scattering 
of two electrons. Assume they initially have states of equal and opposite momentum k and 
—k, or zero center of mass. Why this is necessary for the Cooper instability is shown later. It 
is also assumed the particles have opposite spin states ¢ and |, so that exchange scattering 
does not occur. The interaction potential does not flip the electron spin and the spin states are 
preserved in the scattering process. 

Figure 10.2 shows a double scattering event between two electron lines which are 
moving in the same direction in time. This process is the scattering in the second Born 
approximation, where the first Born approximation is shown in Fig. 10.1(a). Each dashed line 
represents an interaction of the type shown in (10.2). If two electrons initially start in opposite 
momentum states k and —k, then a momentum transfer q leaves them still in opposite 
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FIGURE 10.2 
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momentum states k + q and —(k + q). This process is shown in the figure as the pairs 
(k,, —k;) in the intermediate state and (k’, —k’) in the final state of the double scattering. The 
effective scattering in the first and second Born approximation is 


Pky Vik —k)V(k — K) 
(2n)° 26k — 2&,, 


x {[1 — nrk M — nr Ék) - ne(Ex,)*} (10.3) 


The second term on the right is the contribution of Fig. 10.2. The energy denominator 
contains the initial state energy ¢, + -k = 2&, minus the intermediate state energy 2), . The 
numerator contains the important factors [1 — nr(&, yy —np(E&,) = = | — 2np(&,) which 
occur because the two particles, which are being scattered, can only go into the states (k,, t) 
and (—k,, 4) if they are not already occupied by an electron. This explains the factors 
[1 — ne(Ex N. The other term nrl) represents the scattering back into this state, 
(k,, 1) —> (k, 1), etc., since the result depends upon the net scattering. What is left are the 
remaining factors of 1 — 2n-(&,,). These occupation factors of n-(&,,) play a crucial role in 
the theory and are the cause of the instability. 

The integral in (10.3) may be evaluated. The key is that the interaction acts only over a 
small energy interval near the Fermi energy. Over this interval, of a Debye energy, the 
electron density of states in most metals is nearly constant. Change integration variables to 
fa?k,/(2n)° = f dé,N(E,) and treat N(E ~ 0) = Np as a constant. At zero temperature the 


result is (Sk, = 61,5 = É) 


Vlk — kK’) = Vik — k^) + | 


On —n 
Vere(k — k’) = Vk —K’) + Np n| ae, 2G) 
—Op 6— G1 
The integrand contains the factor 5 —n,(§,), and the 5 term may be ignored. It does not cause 
the singularity, and its inclusion leads to little effect. The term in n,(€,) leads to a logarithm 
which is singular at the Fermi energy: 


Op 0 
[i nge] = - nS) (10.5) 


The results of the first and second Born approximation may be summarized as 


(10.4) 


Voce = Bae —NrVo in( =.) (10.6) 
Op 


The term —Np Vo In(E/@p) is regarded as the vertex correction which results from the 
additional scattering between electrons. The scattering becomes very large for electrons near 
the Fermi energy |§/@p| « 1. Further insight is gained by considering the sum of diagrams in 
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FIGURE 10.3 


Fig. 10.3. Each additional interaction (dashed line) causes two more Green’s functions which 
are going parallel in the intermediate state. Each new set of intermediate states has the same 
type of integrand, so that a term with (n + 1) ladder diagrams gives a net contribution of 


-V |- V in( =) (10.7) 


Each term with n ladder diagrams has a factor of [-N; Vo In(E/@p)]", and the summation of 
these terms produces the series 


Vote = —Vo D -Nea in( =) | (10.8) 
N=0 Op 

—— A 

1+. NpVo In(E/@p) (10.2) 


This series can be summed to produce a net potential with an energy denominator (10.9). This 
energy denominator equals zero at 


Ey = Op expl- 1/NF Vo] (10.10) 
The energy denominator has a pole at this energy, since the denominator can be written as 


Np Voln(&6/6o). In the vicinity of &ọ the denominator can be approximated using 
E = Éo + (É — &) and the scattering potential has a pole: 


1 1 l l 
r= NEE Nal + E/E] 
~ o 
~ NE — bo) me 


This pole is sufficient to cause the instability. The electrons near the Fermi energy will interact 
with their pair on the opposite side of the Fermi sea. The mutual scattering produces a pole in 
the scattering amplitude, which will make the pair of electrons try to bind together. Of course, 
all electron pairs are doing this simultaneously, so that the entire metal undergoes a phase 
transition. The existence of this pole depended on the sharpness of the electron distribution 
n-(E,) = O(—E,). If all the electrons near the Fermi energy become paired, one must 
reconsider whether this sharp distribution still exists. A theory of superconductivity must self- 
consistently determine the properties of bound electron pairs. This derivation is the basis of 
the BCS theory. 

Another way to describe the instability is as a function of temperature. The calculation is 
repeated as a function of temperature. It enters into the electron distribution n,;(&,) by 
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changing the step ©(—€,) into a smooth function with an energy width of several kT. The 
influence of n-(§,) can be approximated by expressing the integral (10.5) as 


On 2 2 
dé, a ~ -pa er (10.12) 


Follow the same steps through the summation of all the ladder diagrams, and conclude that 
the effective potential now has an energy denominator of the form 
V 
Vg = -—— 2l (10.13) 
1 + Nro niy E? + (kT)? /@p] 


At zero energy (€ = 0), Ve becomes singular when the temperature is lowered to the critical 
temperature T,: 


kgT, = @p exp] — | (10.14) 


Nr Vo 


This argument correctly predicts the right form for the transition temperature of the BCS 
theory, which is 


kgT, = 1.140p apl -ay (10.15) 
The result is indeed proportional to the Debye energy, in agreement with the isotope effect. Of 
course, to explain the isotope effect, it must be shown that the exponent 1/(Np Vo) does not 
change with the ion mass. It actually does depend on the ion mass M, which is contained in 
the electron-phonon matrix element, in the interaction potential Vj. Some experimental 
values of T, for elemental superconductors are given in Table 10.1. 

The theory of the Cooper instability should be compared, for example, with the ordinary 
binding of two isolated particles. If the two particles are isolated, they do not have to obey the 
statistics of a collection of identical particles. Then the scattering theory does not contain any 
of the occupation factors; all states may be used as intermediate states since there are no other 
particles. In this case the multiple scattering theory was described earlier, in Sec. 9.2, in the 
theory of Wannier excitons. The multiple scattering theory may be described by a vertex 
function: 


d'k, Vik —k,)T(k,, k’) 


(2n)° 2ek — 25K, (10.16) 


T (k, k^) = V (k — k^) + | 


TABLE 10.1 Critical temperatures 
T, of elemental superconductors (K). 


Mg Al 
— 1.2 
Zn Ga 
0.91 1.1 
Cd In Sn 
0.56 3.37 3.73 
Hg Tl Pb 


4.16 2.38 7.22 
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The solution to this vertex function is equivalent to solving the two-particle Schrödinger 
equation in relative coordinates: 


| 

[SF + V9) + VE -rd -Eue =0 (10.17) 
The problem is factored into the relative and center of mass motions: 
r=", —F), Wry, r2) = e Pelr) (10.18) 
p? 
R = {(r; +r), E= +e (10.19) 
y2 

|- P + V(r) — J d(r) = 0 (10.20) 


The center of mass motion is plane wave, and the relative motion becomes a one-body 
problem. Without the occupation factors, the relative scattering of two particles by an 
instantaneous potential is a trivial problem. When bound states occur, they are at negative 
binding energy in the relative coordinates, i.e., at € < 0. This behavior is in great contrast to 
the Cooper instability, where the pole occurs at a small negative energy relative to the Fermi 
energy Ep, so the pole is at a positive energy Er — € . Two electrons cannot really bind at that 
energy, since their net energy is positive. The instability occurs because it appears to them as 
if they should bind, although if they tried, they would find they could not. The role of the 
occupation factors [1 — 2np(61)] in the argument of the scattering integral is what moved the 
apparent pole out to the Fermi energy. 

One can now see the reason the electrons must be paired with opposite momentum. The 
instability is caused by the sharpness of the Fermi surface, which is fixed in momentum space. 
Two electrons with arbitrary wave vectors k, and k, interact by exchanging phonons and this 
interaction can be attractive. Nevertheless, this does not lead to an instability when the center 
of mass motion is nonzero, since the zero energy of relative motion does not coincide with the 
location of the discontinuity in the momentum distribution. The center of mass transformation 


P=k, + k, k, =}P +k (10.21) 

k=3(k,—k,), k,=}P-k (10.22) 
k P 

Sk, + Čr, = m 4m 2Er (10.23) 


changes the relative scattering integral of two particles into 


[2 ro miea — tree (10.24) 


(2n)° k2/m — (k + q)*/m 


The energy denominator factors into just the difference of the relative energies, as it should, 
but the Fermi surface discontinuities are not at (k+ q) /m. Instead, they are at 
(+3P +k-+q)?/2m, which gets smeared with the averaging over angles. This smearing 
makes the pole disappear rapidly for nonzero center of mass momentum P. 

The same results can also be derived from a Green’s function analysis. If two particles 
interact with an instantaneous interaction, as in Fig. 10.2, and propagate parallel in time or t 
space, then the vertex has the combination of Green’s functions 


GK, t— TGC Kt — 1) (10.25) 
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The frequency spectrum is obtained by integrating this pair over (t — t’) 
Bo 
X(io) = | dte MgO, t — 1)GO(kK,, t — 7) (10.26) 
0 


This integral can be evaluated directly in t space by using the definitions of Y® in Sec. 3.2. 
Alternately, it can be converted to a frequency summation, which has the form 


1 
X(io) = 5 IK, ip)M (ky, iw — ip) (10.27) 
ip 


_ | — Nr(Sx, ) ~ nr(Sx,) 


io 8, E, (10.28) 


The expression contains one of the standard Matsubara summations which was given in Sec. 
3.5. It has exactly the combination of occupation number factors stated in (10.24). It also has 
the same energy denominators. The binding of particles is described by parallel motion in 
time. 

It is interesting to generalize the Cooper instability to other circumstances in which 
similar effects could occur. For example, consider the interaction between two different 
fermions, e.g., electrons and holes in a semimetal or semiconductor. They have the Fermi 
distributions n,(€,) and n,(&,), respectively. The scattering theory contains the vertex term for 
the scattering (k,, k,) — (k, + q, K; — q). 


d’ 1 —n,(k — np(k, — 
(27) Eek, + Ehk, — Ee,k,+q — Ek,k,—q 
This vertex would also cause Vg to be resonant if one had k, = —k,, but this cannot happen 


in a solid in equilibrium, since otherwise the electrons and holes would recombine. However, 
one can still get instabilities out of this process, as was originally discussed by Keldysh and 
Kopaev (1965). The instability leads to a state called the excitonic insulator, which has been 
reviewed by Halperin and Rice (1968). 

| A logarithmic singularity is obtained in (10.29) even if one of the occupation factors is 
Zero, Say ną, = 0, and only one particle obeys many-particle statistics. One example is the edge 
singularities in X-ray spectra first predicted by Mahan (1967) and discussed in Sec. 9.3.2. In 
electron—hole scattering, the electron occupation factors remain in the scattering function 
even if n, = 0 


(1 —n,)( —n,)— nean, =1—n,-—n, ¥1—n, (10.30) 


This behavior is in sharp contrast to the multiple scattering theory for an electron scattering 
from a fixed potential such as impurity scattering. Then there is only one fermion, say an 
electron, which has a factor (1 — ,) for scattering k — K’ and the factor (—n,) for the rate of 
back scattering k’ —> k. The back scattering enters with a sign change because of the anti- 
symmetry of the single fermion state. The two rates are added, 1 — n, + nę = 1, and the 
occupation number terms n, cancel out. The electron scattering from an impurity is treated as 
a one-body problems (except for self-consistent screening) even in a many-particle system. 
This cancellation explains why such factors as np(&) do not appear in the vertex equations for 
impurity scattering, which were solved in Sec. 8.1. 

Another possibility is to have one or both of the particles be bosons. Then the scattering 
probability is multiplied by [1 + ,(§,)] for scattering into state k, and by n ,(E,) for scat- 
tering out of it. For a single boson scattering from a fixed potential, the scattering integral has 
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the rate (1 + ng) for k — k, and the rate ng back, so the net is 1 + ng — ng = 1, which does 
not contain occupation factors. The potential scattering is again a one-body problem even in a 
many-particle system. However, two bosons which are mutually scattering have the factors 
(1 + ng) +n) — ngng = 1 +ng +n}, So that the scattering is enhanced because of the 
occupation of other particles. 


10.1.1. BCS Theory 


The basic feature of the BCS theory is that pairing occurs between electrons in states 
with opposite momentum and opposite spins, e.g., between states (k, 1) and (—k, |). The two 
spins are combined into a spin singlet, with S = 0. The singlet was chosen in BCS theory on 
the basis that the other choices of spin combination would lead to a triplet state with S = 1. 
The latter choice (S = 1) implies the superconducting state has magnetic properties, which in 
fact are absent for simple metals. The choice S = 0 seems most reasonable. Later work by 
Balian and Werthamer (1963), who solved the BCS equations for S = 1, showed that the 
triplet state had smaller binding energy and was therefore less favored. However, the recent 
theories of superfluidity in >He are based on the premise that the pairing occurs in the triplet 
state (see Chapter 11). Triplet pairing is possible and may exist in heavy fermion solids such 
as UPt, and UBe,, (Han et al., 1986). The discussion will assume that the spin arrangement is 
singlet. 

The pairing of electrons in the BCS theory must cause correlations in their relative 
motions. The pairing is described by introducing a new correlation function, similar to a 
Green’s function, for particles of opposite spin. Following Abrikosov et al. (1963), these are 


F(p, t— T) = —(T;Cpo C$ (T) 
F (p, T — T) = (T,C_py OC), (0)) (10.31) 
F'(p, t- T) = (TOC (0) 


The Green’s function 4 has the same definition as usual, although it has a different algebraic 
form in the superconducting state. The F and Ft functions are identically zero in the normal 
state. Even in the superconducting state, the bra (| and ket |) notation must have special 
meaning. For instance, in the definition of the F function, the ket |) is operated on by two 
destruction operators and then closed by (|. The state |) must have two more electrons than (|. 
A basic feature of the BCS ground state wave function is that it is composed of a super- 
position of electronic states containing a different number of electrons. Such a formulation is 
possible in a grand canonical ensemble. The procedure will be to find a self-consistent 
equation for the correlation function F or its Hermitian conjugate Ft. At high temperatures 
the only solution is F = Ft =0, but a nonzero solution becomes possible at low 
temperature. 

The order of the spin indices (+, |) is important in specifying the Green’s functions. 
Suppose by mistake for F one wrote 


(T,C_p (DC, (7) (10.32) 
Since F does not depend on the sign of p, this correlation function is the same as 


(T, Cy, (t)C_py (T) (10.33) 
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The Cpo operators anticommute, and doing this gives 
—(T,C_p(v)C pt (T)) (10.34) 


—F (p, T — t). These kinds of sign errors can be avoided by always following the definition 
closely. 

The objective is to provide the simplest possible derivation of the BCS theory. First 
assume a model Hamiltonian of the form 


H= > Epp oCpo F57 ve ACON p+q.s© i —q,s' Cp's' Cps (10.35) 


The interaction potential V (q) between electrons is taken to have the form in (10.2), which is 
an attractive constant V (q) = —Vo over a range of energies within a Debye energy of the 
Fermi surface. With this Hamiltonian, a set of self-consistent equations will be derived for the 
Green’s functions %, F, and Ft. The derivation is done using the equation of motion. 
Consider 


d 
q Crol?) = [H, Cpo] = —SpCpo = 7 LOG Cys Cp_a.c (10.36) 


From the definition of the t-ordered product, the first derivative of the equation for the 
Green’s function is: 


< ap, t—T)= -<10¢ —v)(Cyg(t)Ch, (T) — O — (CT (VC po(t))} (10.37) 


= —6(t — T) — (r È Cp) Che (t »} (10.38) 
Using the result (10.36) for dC,,,/dt gives 
(-F.-& )#@.t-2) E ro (10.39) 
ot V ap's 
x (TeCh_ ge Cpe ()Cp-4, CCT) = 8(t — 1’) (10.40) 


The bracket of four operators in the interaction term must be reduced to products of pair 
operators. There are many ways of doing the pairing, since in addition to the normal 
combinations such as (CC')(CC7"), there are also arrangements such as (CC)(CTCT). One 
simplification is to assume that long-wavelength phonons give a zero potential, so that 
V(q = 0) = 0. Neglect the pairing which occurs when q = 0. For a normal metal, there 
would only remain the pairing 6)» 5y—o%p_q4(Pp, T — T) which gives the exchange energy. 
This pairing occurs in the superconductor as well but is not the only term. The pairings which 
include the ¥F functions must pay attention to the spin variables. The combination 
© = —s’ = Î gives 


(Tı Ch as (T)Cyy (1)Cp-q,0(1)C}o (T) (10.4 1) 
= 85. Sy—pigF (P — 4, OF (p, T — 1) (10.42) 
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where the sign change resulted from an odd number of operator rearrangements. Similarly, 
the choice o = —s’ = | gives 


Ch g(t) Cyg(t)Cp_g.o(t)Ch(1')) (10.43) 
= —86_ ySp—_pigF (—P + q, OF '(—p, t — 7) (10.44) 


These two results are identical, since later it is shown that the F and F t functions do not 
depend on the sign of their arguments either momentum or t. The last term in (10.40) gives 
the expression 


U L5 Vq)(T. ch —q,s’ (Tt) Cys (t)Cy_g.o(t)Ch(7)) 


V gp's 


= Ip VZP. T- ma- FP -4.0)F"p.t—7)] (10.45) 
q 


The first term is the exchange self-energy of the electron due to the phonon induced inter- 
action between electrons. It provides the self-energy term È, = —2V(q)n,_,. A careful 
investigation shows that this self-energy does not change much between the normal and 
superconducting states. The self-energy of the electrons, from phonons, causes a change in 
the electron effective mass given by the parameter A. This effect is not large in weak 
superconductors, so it may be ignored. In metals where A is large, such as lead, the super- 
conducting state can be expected to significantly alter the properties of electrons near the 
Fermi surface. Hence the need, in these cases, for strong coupling theory. This self-energy 
term is neglected in our weak coupling theory. 

In the second term of (10.45) there arises the combination of factors which are defined as 


AP) = -- VQ) Fe - 4, = 0) (10.46) 
q 


The quantity A(p) is the gap function in the BCS theory and plays a central role in the 
properties of the superconductor state. The quantity A(p) is defined to be positive, since the 
right-hand side of the definition is positive, with an attractive potential V(q) < 0. Collecting 
these results in (10.40) gives 


(-5 7 3 Gp, T — T) + APF P, t— T) = 8l — 7) (10.47) 


The equation has two unknowns in Y and F t so another equation is needed to link these two 
quantities. It comes from the equation of motion for the ¥ t function: 


= FÝ(p, t- T) = -=10¢« -T Ci Ci E) (10.48) 
- OM = HCL OCh 


= Ar — TCH, OEA AERO] pC) 


= (7, E cho Oii K) | (10.49) 
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It lacks a term 6(t — T’) because the CT operators anticommute. The time development of the 
F¥* operator is determined by 


Cis) = [H, Cho] = E Ci ot. Von Vict p+q, of, p’+q, y Cp',s 


This equation is not the Hermitian conjugate of dC,,,(t)/dt in (10.36), because Colt) is not 
the Hermitian conjugate of C,,(t). The result for dC'/dt brings us to the equation for 
OF! /ar: 


(- ? p) Flt ~7)+- > V(q) (10.50) 
oT V gp’s 
x (TCh g Ch pa Cp EC (7) = 0 (10.51) 


Again there is an expression for four operators. The operator ct pl (t’) is unique, since it is the 
only one which is not operating at time t. Pairing it with each of the other three operators 
gives three terms: 


Cyt): By dy—pA'(p)A(—p, t — 7/) 
CiaO:  8q-0%p F P, T-T) (10.52) 


ch yas T): — 8 yap Spy gpg F Tp, t-r) 


The first term has AŤ (p), which is the important term; the second requires q = 0, which was 
assumed is zero; and the last gives the exchange potential X, = — > V(q)Mpiq> Which is 
ignored again. The exchange potential just renormalizes the chemical potential. This term is 
omitted. These steps brings us to the final equation: 


(- < + g) IP, t=T) +A IP, t- 1) = 0 (10.53) 


The gap function is assumed real, AŻ(p) = A(p), which is verified later. 
The Fourier transforms of the correlation functions are defined in the usual way: 


l —ip,T . 
Gp, T) = B> e Pn G (p, ipn) 
Pn 
dg | —ip, T T . 
F (p, 1) = pre "F (p, ipn) (10.54) 
Pn 
l 
F'(p, 1) = gle PTI, ip) 
Pn 


After transforming, the two equations (10.47) and (10.53) are 


(ip, — p) (P, ipn) + A(P)F (p, ip,) = 1 (10.55) 
(ip, + Ep) F (p, ip,) + A(p)4(p, ip,,) = 0 (10.56) 
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These algebraic equations are easily solved: 


ip, +E 
G(p, ip,) = -——,—~— 10.57 
(Pe Pn) = Wp) (10.57) 
A 
F(p, ip,) = F'(p, ip,) = 5” (10.58) 


p2 +& + A*(p) 


The equivalence of F and F? can be shown by deriving similar equations for F. 

The results (10.58) can be used to test some of the assumptions in the derivation. For 
example, consider the Fourier transform (10.45) of F(p, —t). Changing the sign of the 
dummy variable of summation gives 


1 , 
FP -=z FP, ip) = FPT) (10.59) 
Similarly, 
1 , 1 , 
F (p, T= 0) = p> 7 ip„) = o7 P ip„) = F'(p, T= 0) 


This relation causes the energy gap equation to be real, and for At (p) = A(p) 
1 i 
AP) = -72 VUF P -q,0)= -72 V(q)F '(p — q, 0) 
q q 


= A'(p) (10.60) 


Note that if the energy gap A(p) is set equal to zero, there is found the usual form for the 
normal state Green’s function and zero for the others: 


A=0: IP, tp) = ip — &, (10.61) 


F=F'=0— 


It must still be shown that there is a self-consistent solution to the equations for A Æ 0. 


The Green’s functions Y, F and Ft have poles at the points +E, = H / E + A(p)’. E, 
is called the excitation energy of the superconductor. It will occur repeatedly in the various 


formulas. In BCS theory the gap function A(p) for s-wave superconductors is not dependent 
on momentum p and is treated as a constant for each temperature. Of course, it very much 
depends on temperature and vanishes at the transition temperature of the superconductor. 
However, A is treated as a constant at a fixed temperature. [Later, in strong coupling theory, 
A(i®,,) is found to be a function of energy.] The excitation energy E, depends on E > 0. The 
minimum excitation energy is A, and E, > A. An important feature of the BCS theory is that 
the particles are paired, and it is not possible to excite just one quasiparticle with excitation 
energy £,. Instead, one must break a pair of particles and excite them both to the band of 
excitations. This pair breaking is shown in Fig. 10.4, where the horizontal line represents the 
bound state pairs at the chemical potential of the superconductor. To break a pair, one must 
excite both particles to the excitation line, so that it takes energy E, + Ey > 2A. The mini- 
mum excitation energy of the superconductor is the energy to break a pair, which is E, = 2A, 
where E, is the energy gap of the superconductor. This fact must be kept in mind when 
comparing thermodynamic data with the BCS theory. 
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FIGURE 10.4 BCS model of superconductor. (a) The ground state has electrons paired at the chemical potential. (b) 
Excited states occur by breaking a pair and forming two quasiparticles, each with excitation energy E,,. 


The existence of condensed pairs also clarifies the meaning of the bra (| and ket |) 
symbols in the definition of 


FP, T= 1) = (ITC (C1, E) (10.62) 


At zero temperature, the ket |) is the ground state of the superconductor, which has all bound 
pairs at the chemical potential u. The operators Ch. and Cc | create two excitations with 
energy Ep + £_, = 2Ep. The bra (| state has the same number of particles but two less in the 
ground state. 

The Feynman diagrams for the three Green’s functions are shown in Fig. 10.5. A 
diagrammatic representation is useful when evaluating correlation functions. The way of 
doing the diagrams was suggested by Abrikosov et al. (1963). The Green’s function 9 
represents creating a particle at one point in time and destroying it at a later point in time. It is 
drawn as an arrow with points at both ends and both points are in the same direction. The 
inward point at one end symbolizes particle destruction, and the outward point on the other is 
particle creation. FÌ is represented by an arrow with both points outward, since a particle is 
created at both times. Similarly, the Green’s function ¥ is an arrow with both points inward, 
which represents particle destruction at both points in time. 

It is conventional to introduce the following coherence factors: 


(-2) 
Ep (10.63) 


4 >———_< 


FIGURE 10.5 
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which are misnamed since they have little to do with coherence. Their usefulness stems from 
the fact that they are the residues of the poles of the Green’s functions. Write the Green’s 
functions (10.58) as 


uy v? 
G (p, ip) =., 


+ — 10.64 
ip— Ep ip+Ep ( ) 


, , 1 1 
F (p, ip) = F'(p, ip) = —upr, (Gs — ate) (10.65) 
p p 


This form is useful for doing the contour integrals associated with the summations over 
Matsubara frequencies. The excitations of the superconductor are fermions, and the 
frequencies ip, in Y and F are for fermions ip, = (2n + 1)ni/ßB. The spectral function for the 
Green’s function is 


A(p, £) = —2 Im[Gre(p, €)] = 2m[v25(e — E,) + 25(E + E,)] (10.66) 


These equations are not really solved until A is evaluated. It is found from the definition of the 
gap function in (10.46), where 


1 A A BE. 
= = — = — 2 P e 
F (p, t = 0) = B >. 7 TE 2E, tan ( ) (10.67) 


The summation over frequencies is evaluated in the usual fashion, by the contour integral 
d. A 
bs nel) =a =0 (10.68) 


The integral is zero since the contour is taken to infinity. The poles of np(z) give the 
summation over ip, while the poles at z = +E, give the hyperbolic tangent result. The 
equation for the gap function is 


dq A(p — q) (= - 
A =- |r ———— tanh ~ea) 10.69 
Since A is constant, it can be factored from both sides, which leaves the integral equation 
for A 
N; Vo [° d6 BE 
1a Arbo l dS anh ( PE 
2 Jo E tanh( 5 ) (10.70) 


E= +A (10.71) 


The solution is found in two limiting cases. The first is at zero temperature, where the 
hyperbolic function tanh(BE/2) = 1 


Neho e dE 
2 —@p fe? 4. A? 


/ 2 
= Nr Vo In[é + E? + A?]o” ~ Nr Vo in( “5” (10.73) 


(10.72) 
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which may be solved to produce the equation for the energy gap 


l 
E, = 2A = 40p exp(- A z) (10.74) 


In BCS theory the gap equation (10.70) was solved as a function of temperature. The energy 
gap gets smaller as the temperature is increased, as shown in Fig. 10.6. In BCS theory the 
critical temperature was the result stated earlier in (10.15): 


1 
kpT. = 1.1405 exp( zr) (10.75) 
The ratio of these two results predicts 
E 4.0 
—& = — = 3.52 10.76 
kT, 1.14 > ( ) 


Both the energy gap E, and the transition temperature T, can be measured, so this can be 
tested. It is found to work well in weak superconductors such as aluminum, which is generally 
well described by the BCS theory. For strong coupling superconductors, the ratio increases in 
value. 

The BCS theory is often called a mean field theory, because the gap equation (10.69) has 
that form. It has a similar form to the mean field theory for magnetism in metals (see Kittel, 
1966; he uses the term molecular field approximation). The order parameter, in this case the 
gap A, is set equal to a thermodynamic average over the excitations, which are also functions 
of A. The self-consistent solution determines A. There is a critical temperature T, above 
which no solution is possible. It is seldom that the mean field theory is an accurate description 


o Tin 33.5 Mc 
e Tin 54 Mc 
— BCS theory 


A(7)/4(0) 


VT, 


FIGURE 10.6 The ratio A(7)/A(0) vs. T/T, in tin. The solid curve is the BCS theory. The points are the ultrasonic 
attenuation data of Morse and Bohm (1957) (used with permission). 
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of physical reality, but it certainly works well for BCS. In statistical mechanics, mean field 
theory works well when the forces between particles are long range. In the case of super- 
conductivity, the forces are short range, but the bound state orbits of the particles extend over 
a long distance. 

Besides the existence of an energy gap, another dramatic prediction of the BCS theory 
was that the density of states p(£) for excitations has a square root singularity. It is defined as 


-1 
pE) = (=) a (10.77) 


qo) VEX A? 


This spectral shape is shown in Fig. 10.7, with a gap until A and then a square root singularity. 
The density of states arises from the variable change 


I, a= l ae() = EZO (10.78) 


This density of states should not be regarded as just a function which results from a change of 
variables but as a real property of superconductors which can be measured. 

The above theory is valid for a superconductor with an isotropic energy gap. Electron 
states in crystals belong to representations of the group which leaves the crystal invariant. 
Angular momentum eigenstates, such as s-, p-, etc., do not belong to representations of 
crystal groups. Instead, the crystal representations are mixtures of angular momentum states. 
The energy gap in crystals is found to vary somewhat with direction of the wave vector, 
because of this mixture of angular momentum states. However, the phrase “s-wave super- 
conductivity” is used to describe the state which belongs to the most isotropic representation. 
In a cubic crystal, the gap would have the same value along all +x, +y, +z axes. Similarly, the 
phrase “p-wave superconductivity” applies to crystals in which order parameters might have 
different signs in the +z directions. 

The cuprate superconductors may have “d-wave symmetry”. The conduction electrons 
are confined to move in the (x, y) planes composed of copper and oxygen atoms. In some 
cases the x and y directions have the same lattice constant, and have identical symmetry. Then 
the planes have the same crystal symmetry as a square. In polar coordinates the angle 8 gives 


FIGURE 10.7 BCS density of states of a superconductor. 
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the wave vector direction in the plane (tan 0 = k,/k,). In this case the superconducting order 
parameter for d-wave symmetry is either of the two functions 


An (8) = A, cos(28) (10.79) 
An (0) = A, sin(28) (10.80) 


Similarly, another possible order parameter called “anistropic s-wave” is 
A,(8) = Ay + A, cos(48) (10.81) 


Note that the term cos(48) preserves the symmetry of the crystal: at 0 = 1/2 the function has 
the same value as at 9 = 0. The x- and y-axes are identical for anisotropic s-wave super- 
conductivity. In contrast, the d-wave superconductivity breaks crystal symmetry. 

An important feature of d-wave superconductivity is that the order parameter changes 
sign as the angle @ is varied. There are nodes, where the gap function vanishes. The aniso- 
tropic s-wave can also have nodes if A, > Ag, but that is unlikely. 


10.2. SUPERCONDUCTING TUNNELING 


The most important verifications of the BCS theory came from electron tunneling 
experiments. They measured the energy gap, as a function of temperature, in perfect agree- 
ment with the BCS theory. They also measured the density of states function p(£). Later, 
Josephson (1962) predicted the coherent tunneling of pairs, which was also quickly observed. 
These experiments provided a detailed verification of the BCS theory. 

The theory of electron tunneling between two metal electrodes, through a barrier 
consisting of a metal oxide, was derived in Sec. 8.6. The tunneling Hamiltonian was used to 
obtain the following expression for the tunneling current 


l=],+L, (10.82) 

Is = ie[U,(—eV) — Ua (—eV)] (10.83) 

= —2e Im[U,..(—eV)] (10.84) 
B 

U(iw) = -| dre (T_A(t)A'(0)) (10.85) 
0 


A= > TCC (10.86) 


kpo pe 
The term J, is the single-particle tunneling. The term Z, is for the Josephson effect, and is 


evaluated below. In Sec. 8.6 this expression was evaluated, and reduced to an expression in 
terms of spectral functions and occupation numbers 


Is = 2Y To? [ZAPE Pek, DE -nEeD 010.87 
p 


The subscripts L, R refer to which electrode. The tunneling matrix element T, kp can be 
approximated as a constant for the small voltages used in superconductive tunneling. Then 
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the summations over wave vectors give the interacting density of states N; (£) on the two 
sides of the junction. 


I, = 4ne|T |? | aem +eV)[n-(e) — nre + eV)| 


This expression is also valid in a superconductor with an appropriate interpretation of the 
density of states. 


10.2.1. Normal Superconductor 


Electron tunneling experiments are often done with different metals on the two sides of 
the junction. Since different metals have different transition temperatures (and some are not 
superconducting), it is quite easy to arrange that one metal be normal and the other be a 
superconductor. This experiment is interesting, to the theorist, because it provides a direct 
measurement of the BCS density of states p(E). Take the right side to be superconducting and 
the left side to be normal. The derivation is simple, since the many-body expression (10.87) is 
valid in this case. Just use the respective spectral functions for the normal and super- 
conducting sides of the tunneling junction: 


Ap(k, £) = 2n[uzd(e — E,) + v78(e + E,)] (10.88) 
A; (p, € + eV) = 2nd(e + eV — §,) (10.89) 


Is = 4ne|T |’ 2 (upd(eV + Er — plnr (Er) — nr (p) 
p 


+ 5(eV — E; — &)[np(—E,) — nr (E,)I} (10.90) 


Next use the fact that n-(—E) = 1 — n;(E). Consider the situation at zero temperature, where 
the density of excitations in the superconductor is negligible [n-(E) ~% (eP? + 1)7' ~ e7BE 
~ 0]. The variable change (8.337) brings us to 


©O CO 


1, = 4nelTËNNi| d8y| AEBN — Ey = & IL =ar 


— uZ5(eV + E, — &,)nr(E,)} (10.91) 


The limits of integration have been extended to infinity, which has no effect on the result since 
the limits are narrowly confined. There are two terms in the curly braces: the first is positive, 
and the second is negative. The first will contribute only when eV > 0 (actually eV > A), and 
I; is positive in this case because this direction of electron current is defined to be positive. 
The second term contributes only when eV < —A, and J, is negative in this case, in agree- 
ment with the definition, since Zç has the same sign as V 

At zero temperature, the first term has a factor 1 — np(&p) which limits ¢, > 0. The 
integral dé, extends over positive and negative values of €,. Factors which depend on 


646 Chap. 10 e Superconductivity 


Ey = 4 EZ + A? are the same for both signs of €,. The term linear in €, in the coherence 


factor v? in (10.63) averages to zero because of the cancellation of the +6; and —&, parts: 


[autre =5[ ae (1-2)\ren=5] de se) 


— Ex 2 —0Oo 
- | dé, f(E) = I. dE p(E, (Ep) (10.92) 


where f is any function of E}. The integration is changed to dE,, which produces a factor of 

the density of states p(E) = d&/dE in the superconductor. The integral dé, can be used to 

eliminate the delta function for energy conservation &, = eV — E; > 0. Since &, > 0 then 

eV > E, > A, which gives for eV > 0 
eV eV 

Igy = L @(eV — a) dEp(E) = ŽL @(eV — a| FE 

e A e A VE? — A’ 


= — @(eV — AX (eV)? — A? (10.93) 


The current is a simple square root function of the voltage. This type of dependence is 
observed experimentally. 

Another quantity which can be measured (by lock-in amplifier techniques) is the 
dynamical conductance d//dV as a function of voltage: 


E) -eyo LEN 10.94 
(= oy eVoo corpo at (10.94) 
(5) 

Yw — @ey — Ay — E = peer) (10.95) 


dl, 2 42 
(P) ve =A 


The experimental technique is to compare d//dV for the tunneling between the normal metal 
and the superconductor (d//dV).., with that for the tunneling between both metals when they 
are normal (d//dV),,, = Oo. Low-temperature superconductors can be made normal by the 
application of a small magnetic field, so that the two measurements are done at the same low 
temperature. The ratio of these two experimental quantities provides a direct measurement of 
the BCS density of states p(eV). The current is antisymmetric in voltage, which can be shown 
from the second term in (10.91). 

Figure 10.8 shows the experimental results of Giaever et al. (1962) for dI/dV of a tunnel 
junction between Pb and Mg. The experiments were done at a temperature of 0.3 K with a >He 
refrigerator. At this low temperature, the Mg is normal, and the Pb is well below its transition 
temperature of T, = 7.2 K. The factor n-(E) ~ exp(—BA) is indeed small, and there should 
be few thermal excitations in the superconducting Pb. The figure shows the derivative 
(dI/dV), which well illustrates the BCS density of states. Another feature of their data are 
bumps in d//dV at higher voltages. These are due to phonons and are not predicted by the 
BCS theory. They are explained by the strong coupling theory. These experimental results by 
Giaever et al. (1962) were one of the first experimental indications of the need for strong 
coupling theory. 
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FIGURE 10.8 Electron tunneling between a normal metal (Mg) and a superconductor (Pb). The relative conductance 
of a P-MgO—Mg sandwich plotted against energy. At higher energies there are definite divergences from the BCS 
density of states, as can be seen from the bumps in the experimental curve. Source: Giaever et al. (1962) (used with 
permission). 


The measured current is an antisymmetric function of the voltage 7(—V) = —J(V) for 
the small voltages of interest in this experiment. An energy gap of A = 1.34 meV is observed 
in the tunneling current at zero temperature (which can only be approached asymptotically), 
and no current flows unless |eV| > A. An interpretation of these results is provided in Fig. 
10.9. Figure 10.9(a) shows the two sides of the tunnel junction with no applied voltage 
V = 0. The chemical potentials are the same on the two sides of the junction. On the normal 
side (left) the chemical potential divides the occupied states from the empty states in the 
Fermi sea of electrons. On the superconductor side (right), the chemical potential is the 
energy of the paired electrons. The excitation spectrum is shown as the hyperbolic-shaped line 
above the chemical potential. No net current flows when V = 0. 

The situation for a forward voltage eV > A is shown in Fig. 10.9(b). The electrons from 
the normal side of the junction can tunnel through and become a quasiparticle excitation of 
energy E, on the superconducting side. The arrow, which shows the tunneling path, is drawn 
horizontal to indicate the energy-conserving aspect of tunneling. The electron cannot go into 
the pair state on the right as a single step in the tunneling, since it needs two electrons for this. 
It must become an excited quasiparticle during its tunneling step. Later it will find some other 
excited electron and join with it to become a pair state. This pairing is expected to happen 
well after the tunneling process and so will not affect the tunneling rate. 

The situation for reverse bias is shown in Fig. 10.9(c). At zero temperature, all the 
electrons in the superconductor are in the pair states at the chemical potential, and none are 
thermally excited to the excitation state. The single-particle tunneling process must break up 
the pair state. For each electron which tunnels, its partner in the initial pair state must become 
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FIGURE 10.9 Electron tunneling between a normal metal and a superconductor at T = 0. (a) No applied voltage. (b) 
For eV > A, electrons can tunnel from the normal metal and becomes a quasiparticle in the superconductor. (c) For 
eV < —A, a pair is broken in the superconductor: one electron tunnels, while the other becomes a quasiparticle in the 
superconductor. 


an excited quasiparticle. The electron which tunnels does not do this at constant energy, since 
it must give up some of its energy to its partner to permit it to overcome the excitation gap A. 
Energy conservation can be understood as follows. The initial energy of the paired electrons 
is Einitial = 2p. The final energy is ug + Ez for the one in the superconductor and pt, + &, for 
the one in the normal metal. The total energy of the original pair in the final state is 


Esinal = Hr + Hz + & + Ex = Einitial = 2HR _ (10.96) 


The entire process conserves energy, so this is set equal to the initial energy. The quasiparticle 
energy in the normal metal ¢, must be positive since the electron must go into an unoccupied 
state. The condition for the tunneling current to flow is: 


Ur — Hz =V=5,+k > A (10.97) 


This condition is that the magnitude of the voltage must be larger than A, which is the same 
condition for the other direction of bias voltage. The actual physical processes for the two 
directions of voltage are dissimilar, since in one direction there is direct pair breaking. 


10.2.2. Two Superconductors 


The electron tunneling between two superconductors has two kinds of currents. One is 
the tunneling of a single electron, which is similar to that between normal metals or between 
normal metals and superconductors. The other is the tunneling of pairs, which is the 
Josephson effect. This interesting subject will be described in the next section. Now consider 
only the single-particle tunneling, which can happen between two similar or two dissimilar 
superconductors. Take the case where they are different, so there are gap functions Ap and A; 
for the two sides of the junction. At any point in the calculation the two superconductors can 
be made identical, theoretically, by just setting Ap = A,. 
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The many-body formula (8.344) is still valid when the two sides are superconductors 


CO 


[=2e>> Ta? | Ak &)A(p, € + eV)[n-(€) — np (£ — eV)] (10.98) 
kp n 


—0O 


The spectral function for the superconducting BCS Green’s function is used for both sides of 
the tunneling junction: 

A(k, £) = 2n[u25(e — E,) + vz8(e + Ep] (10.99) 

A(p, € + eV) = 2n[uZ8(e + eV — E,) + v28(e + eV + E,)] (10.100) 


Their product gives four terms, which are regrouped using the identity that n-(—E) = 1 
—np(E): 


I = 4ne 2 Tip? {uzu 5(eV +E; — Epner (Er) — nr (Ep) 


+ uzv,6(eV + E; + E[n; (E;) — nr (—E,)] 

+ vyu,d(e V — E; — E,)[np(—£;,) — np (Ep)] 

+ vy0,5(eV — E, + E,)[np(—E,) — nr (-E,))} (10.101) 
I = 4ne 2 [Tkp {U1 — np(Ep) — np (Eluuz 5(eV — Ep — Ex) 


— uzv,5(eV + E, + Ep)] + [np(E,) — np (E,)] 


x [uguzd(eV + E, — E,) — vpv,6(eV — E, + E,)]} (10.102) 
This formula is the most general result, which is valid at nonzero temperatures. The variable 
change to d&,d&, is done next. These integrals over the coherence factors u? and v? will 
eliminate their linear term in €, as described in (10.92). Then the variables can be changed to 


E=E,=y E? + AZ and F' = E,=¥ E + AŻ, which brings us to the expression 


I = 4reN NalT? | dEpa(E)| dE’ p, (E’) 


; A, 
x {[1 — np(E) — np(E)[S(eV — E — E^ — (eV + E + E')] 
+ [n (E) — np(E)[8(eV + E — EB’) — (eV — E + ED) (10.103) 


Some of the terms are positive, while others are negative. They will ensure that J is positive 
for V > 0 and negative for V < 0. These integrals must be evaluated numerically at nonzero 
temperatures. The physics is illustrated by taking the simplest case, which is zero temperature 
and identical superconductors A; = Ap. At zero temperature, set to zero all the thermal 
factors np(E) and np(E'’): 


=] | am| ae o(E’)[8(eV — E — E') — (eV + E+E) 
e Ja A (10.104) 


Oy = 4ne Ng N; |T|? 
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The two terms in square brackets give identical results, where the first is for eV > 0 and the 
second for eV < 0, in such a way that J(—V) = —I(V). Only the first term needs to be done, 
where E = eV — E and eV > 2A: 


eV—A 
I= (2) orev — 2) dEp,(E)p(eV — E) 


— (=)eer — 2A) | OE EleV —E) 


A VE? — A*,/(eV — EF — A? 


This integral is done in the following way. Change variables of integration to x, where 
2E = eV + x(eV — 2A). After some algebra, the integral is in the form of complete elliptic 
integrals: 


(10.105) 


_eV—2A 


(oN O(eV — 2A) |! | (eV)? — x2 (eV — 2A)’ 
= (2) eV +2A | Yd 220 — 2x2]? 
2 
= (=) @(eV — 2A) PART — (eV + 2A)[K(a) — zol (10.107) 


Equation (10.107) is the final result, which is compared with the experiments. One check on 
the correctness of the answer is to set the energy gap A = 0. The coefficient of K(«) vanishes, 
so the part in curly braces becomes eVE(1) = eV. The current is J = OV, which is the 
correct result, since in the absence of superconductivity the tunneling device is a resistor at 
low voltage with the conductance oo. 

The tunneling current is zero for voltages less than twice the gap function 2A. It is 
interesting to examine the threshold phenomenon, for voltages slightly larger than 2A. Since 
K(0) = E(0) = 1/2, at eV = 2A” the current is 


IQA*) = (=) TA oy? (10.108) 


It has a nonzero value, so there is a discontinuous jump in the tunneling current at eV = 2A. 
The current above threshold is 1/4 of the value of a normal tunneling current at the 
same voltage. Exactly this behavior was found by Giaever et al. (1962). Their data for a 
Sn-SnO,—Sn junction is shown in Fig. 10.10. The entire experimental curve fits the theo- 
retical curve from (10.107), except for the finite slope on the “discontinuity.” They also point 
out an interesting feature of this discontinuous jump in current: the discontinuity does not get 
thermally broadened with increasing temperature. As the temperature is increased, the energy 
gaps get smaller, so the value 2A of the jump moves to small voltages. However, the BCS 
energy gap does not get smeared at higher temperature; it only gets smaller. At higher 
temperatures, the nonzero value of np(E) will lead to some tunneling current for eV < 2A, 
but there is still a discontinuous jump at this critical voltage. 

The physical process of the single-particle tunneling is shown in Fig. 10.11. For electron 
flow to the right, a pair on the left is broken up, and one electron tunnels, while the other 
becomes a quasiparticle on the left. The initial energy of the pair is E; = 2u;, and the final 
energy of the pair is Ey = (u; + Ez) + (Ug + Ep). By equating these two energies, we derive 
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FIGURE 10.10 Electron tunneling between two identical superconductors. Current voltage characteristic for a 
sandwich compared with the BCS theory. The experimental curve has a finite slope at 2A because of the anisotropic 
nature of the energy gap. Source: Giaever et al. (1962) (used with permission). 


Hr — ug = eV = E; + Ep. The summation of the two excitation energies must be equal to the 
applied voltage, which is in agreement with the theoretical formulas. 

The reader should now be convinced that the BCS theory rather well describes the theory 
of electron tunneling in low temperature superconductors. Indeed, the BCS theory describes 
rather well all the properties of weakly coupled superconductors, and other good agreements 
between theory and experiment will be shown later. Several severe approximations were made 
in the derivation of the BCS theory. Apparently none of these really matter. Two of the major 
approximations were (1) calling the gap function A(p) a constant A and (2) replacing the 
retarded (i.e., frequency—dependent) interaction between electrons by a static potential, which 
is a constant. A constant potential in Fourier space is a delta function in real space, so it was 
assumed that the interaction between electrons is localized in space. The electrons themselves 


FIGURE 10.11 Physical process of electron tunneling between two superconductors. A pair breaks on the left, one 
electron tunnels, and both electrons become excited quasiparticles. 
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are not localized. Indeed, the pairing orbits are extended in space, but the particles interact 
only when they are in contact. In any case, the essential feature of BCS theory is the existence 
of the additional correlation functions F and Ft. This feature must be right, and the 
particular method of getting to these correlation functions is not very crucial. 

Electron tunneling is a less successful experimental tool for the investigation of high 
temperature superconductivity. Although many experiments have been done, the results are 
inconsistent and difficult to interpret. There are several problems. The cuprates are layered 
materials. The electron conduction is within a layer, and the layers are weakly coupled. That 
means the top few layers of a material may not even be superconducting. Tunneling down to 
the top layer may be going into a normal layer, even if the bulk is superconducting. The other 
geometry is to tunnel into the layers from the edge. In this case, the electron is injected into a 
single layer, which is a two-dimensional electron gas. The last chapter discussed MND 
effects, which is the renormalization of a transition intensity due to the process of injecting an 
electron into an electron gas. Such effects are accentuated in two dimensions. So the spectral 
function measured in tunneling may be severely distorted due to electron—electron interac- 
tions (Shekhtman and Glazman, 1995). Finally, if the gap has d-wave symmetry there are 
nodes where the gap vanishes. Depending upon the directional dependence of the tunneling, 
there may be no energy gap to observe. 


10.2.3. Josephson Tunneling 


The derivation of the tunneling current led to two terms, which are J; and J;. They are 
the single-particle term and the Josephson term. The single-particle tunneling has been 
discussed in great detail, and now examine the Josephson effect. The discussion of this 
current contribution closely follows Ambegaokar and Baratoff (1962, 1963). The Josephson 
current was derived in Sec. 8.6 


L(t) =e | i dt @(t — tte" } (40), ALN — eh’ ate, Ae (10.109) 


—&O 


The single most interesting feature of this equation is that it contains, in the two exponents, 
the sum of the two times t + £t. This dependence has not been previously encountered in any 
of the correlation functions, since they usually come out containing t — t’. The factor of t + r 
is the first indication, mathematically, that the Josephson effect is rather special. The problem 
oft + ť is solved by writing it as 2¢ + ¢/ — t and pulling the factor 2t outside of the correlation 
function. The current is written as 


L(t) = elemani [- dt O(t — te" AO, AC) + he 


Define the retarded correlation function 


Dalt — t) = iO, — tA, A@))) (10.110) 
®.(eV) = D dt VOD (t — t) (10.111) 


IÀ = ~ie[ e” AD len) — e O, (eV) (10.112) 
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The part in brackets is purely imaginary since it is a quantity minus its complex conjugate. 
The Josephson current is 


I,(t) = 2e Im[e~2"""/* @_(eV)] (10.113) 


It is important to include the factor exp(—2iteV /h) in the expression whose imaginary part is 
evaluated. The Josephson tunneling is not just the imaginary part of a retarded function—1t is 
not just a spectral function. Instead, one must multiply by the phase factor exp(—2iteV /h) 
before taking the imaginary part. The phase factor contains most of the interesting physics. 
For a nonzero voltage, the current is actually oscillatory in time, with a frequency œ = 2eV /h. 
This prediction of the theory is so absolute, and simple, that the frequency can be used as a 
method of measuring the ratio of fundamental constants e/h (see Parker et al., 1969). 

The first task is to develop a method of calculating the retarded correlation function 
®..(eV). As usual, start from a Matsubara function, 


B 
(io) = -| dte'"(T_A(t)A(0)) 


B o. 
=- © Trglup | de® T ACC Cpe! (10.114) 
po k’p’o’ 


and later make the analytical continuation iœ — eV + id. The spin index has been explicitly 
added to this expression. The correlation function can be factored into a term for the left and 
right sides of the tunnel junction. In this case the factoring produces the Green’s function F 
and Ft: 


B 
Plia) = 2 > Trp T x, | dre" (T, Ch) (O) T, Cp OCO) 
p 


Bo. 
=2% ToT] dte F’ (k, t)F(p, —1) (10.115) 
kp 0 
1 
=2 F NpT_k—p 5 Y F` (k, ip) F (p, ip — io) (10.116) 
kp ip 


The factor of 2 is for spin arrangement. One spin configuration is shown explicitly, and 
the other one gives the same result. The Feynman diagram for this tunneling is shown in Fig. 
10.12. There is an FT on the right side of the junction and F on the left, where the arrow 
conventions agree with those set forth in Fig. 10.5. The direction of the arrows is continuous 
through the symbol T in a circle, which must always happen with any diagram where particle 
number is conserved. 
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FIGURE 10.12 Feynman diagram for Josephson tunneling. 


The summation over Matsubara frequencies in (10.116) is done by the usual method of 
contour integral: 


5 F* (k, ip)F (p, ip — ©) (10.117) 
ip 


= 1 — n (Ep) — nE | ———— - —_—— 
ype, |l Ep) l oe E, oE, E 
kp P P 


1 
+ [np(E,) — nr(Er)] (eine g TEE) | 


Dipl _k,—p 


lio) = A, Ar Y (10.118) 


kp 2ErEp 


1 l 
x lu — np (Es) — nr (E,)] (w= 7 aE) 
1 l 
+ [np(E,) — np (E; )] (st 7 ate) 


This expression is rather complicated, particularly for nonzero temperatures and different 
superconductors. The physics may be best understood by examining the simplest situation, 
which is zero temperature [n;(£) = 0] and identical superconductors (A; = Ag). With the 
analytical continuation iœ — eV + id, consider the quantity 


Tp Tk — l l 
© V — A? kp“ —k,—p Eee 10.119 
wl) =A OEE, (V +E, +E Ti eV —E, — E, + ið (10.119) 


An important feature of these equations is that the tunneling matrix elements 7, kp are not 
necessarily real, so the combination TkpT-ķ,—p is not real. Write this factor as an amplitude 
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IT|? and a phase factor exp(i®) and consider both of these to be independent of k and p. The 
summations over k and p can be changed to integrations over d€,d¢, and then to dE,dE,: 


(eV) = Z Js(eV)e" (10.120) 


p(E) p(E’) l l 
Is(eV deA°ITPNNel a pO | dE’ —____ — —_____ . 
s(eV) = 4eA*|T\"NNp , E Noy AREAL a gp p) (0120 


=a(-2 >)x (=) | (10.122) 


The integral is an elliptic integral. The retarded correlation function is written as a phase 
factor times a function of voltage J>(eV), where Js is real for |eV| < 2A. The Josephson 
current can be written as the imaginary part of (10.113), or 


B®) = Js(eV) sin(ot + ®) (10.123) 
o = zer (10.124) 


The Josephson current oscillates with time, with a frequency œ = 2eV /h. The amplitude of 
the oscillation is determined by the function of voltage J,;(eV), where it is assumed 
|eV| < 2A. The easiest way to evaluate J;(eV) at nonzero temperatures is to use a different 
sequence of steps. Ambegaokar and Baratoff showed that it is best to return to (10.116) and to 
do the two integrals d&,d&, before the summation over Matsubara frequency ip,. This step 
uses the result 


| dt, F (k, ip) = af A -A (10.125) 
—0o -œ G& +A +p? [A2 + p? 
The function J,(eV) is the retarded part (iœ —> eV) of the function J,(iw): 
Js(i) = 4e¥ Tig a F" (k, ip) F (p, ip — iw) (10.126) 
ip 
ApA 
= 4n? eNANITP 5 a (10.127) 


Py VAR + pry At + Pn — 


The summation over Matsubara frequencies ip, should be done before the step of analytical 
continuation iœ — eV. An error is made if these two steps are reversed, except at the point 
eV = 0. At zero voltage, the result for identical superconductors is [see (10.67)] 


1 
Jev =0 den2N2A2| 7/2 } ———— (10.128) 
° = B 2 A? + p? 
= 2en*N*A|T|? tanh (5) (1.129) 


At zero temperature there is the particularly simple result that 


J,(0) = (= : 2o) (>) (10.130) 
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The coefficient of the Josephson current is the same current found in a normal junction at a 
voltage of eV = nA/2. 

Josephson tunneling has a wide variety of behavior in junctions in static electric and 
magnetic fields. A very simple characteristic current is obtained in the case where there is 
neither a voltage V = 0 nor a magnetic field. Then one gets that the Josephson current is 


I, = Jg(0) sin(®) (10.131) 


where ® is the phase difference between the two superconductors on the two sides of the 
tunnel junction. 

The experimental arrangement is to put a current J through the tunnel junction, so that 
current flows through the interface oxide. The voltage change V is measured as a function of 
the applied current Z. It is found that a dc current can flow without any voltage V This 
behavior is sketched in Fig. 10.13. The de current is the coherent tunneling of electron pairs 
from one superconductor to another. The pair tunneling costs no energy when the two 
superconductors have the same chemical potential. When the experimentalist increases the 
current, the phase difference ® between the two superconductors adjusts to make the 
Josephson current J;(0)sin® just the experimental value J of current. The phase increase 
continues for increasing J until the point is reached where J = J,(0), or ® = 1/2. Then the 
current jumps from the zero-voltage line to the right and on to the experimental curve for 
single-particle tunneling, which is shown as the dashed line. In fact, in most tunnel junctions, 
the maximum phase ® = 1/2 cannot be reached, and the jump to the single-particle tunneling 
curve occurs at applied currents smaller than the maximum J,(0). 

Another experiment is to take a tunnel junction between two superconductors and to 
apply a static voltage eV < 2A. Then one observes an oscillatory current according to 
(10.124). This oscillatory current is the tunneling of pairs from the chemical potential of one 
superconductor to the chemical potential of the other. This tunneling of pairs does not 
conserve energy but takes an energy 2eV for the pair of charge 2e to overcome the voltage V 
The pair must tunnel back, since it cannot complete a transition where energy is not 
conserved. This ac current can be observed by the external radiation it produces (Langenburg 
et al., 1965). 

The preceding two experiments must be performed in zero static magnetic field. The 
Josephson effect is so sensitive to magnetic field that care must be used to cancel out the 
effects of the earth’s magnetic field. This sensitivity to magnetic field, which was pointed out 
by Josephson (1962), is useful for verifying that the tunneling is due to the Josephson effect 


FIGURE 10.13 Current—voltage characteristics of a Josephson junction. The dashed line rising abruptly at eV = 2A 
is the single-particle tunneling curve in Fig. 10.10 at T = 0. The arrows on the vertical axis show that a current can 
flow with no voltage. But an imposition of too much current causes the device to jump to the single-particle curve. 
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GA 
FIGURE 10.14 Coordinate system used in discussing Josephson tunneling. 


and not some other process such as a superconducting short in the oxide—the latter would be 
expected to behave like the current characteristic in Fig. 10.13. The method of including 
magnetic field is as follows. The total phase difference between the superconductors is written 
as ®. The phase is now meant to include the applied voltage K so that we write it as 


I, = Js(eV) sin(®) (10.132) 

2ef' @ av 
®=0,+—] df — 10.133 
° + h l, K dx l ) 


The spatial coordinates are shown in Fig. 10.14. The integral f dx in (10.133) is taken across 
the junction from one side to the other. The dx integral is actually V(b) — V (a) = V, since a 
and b are on two different sides of the oxide and each is deep within the bulk of the 
superconductor. 

The phase ® needs to be expressed in a manner which is gauge invariant. Write the 
phase as an electric field E(r, t) which is always gauge invariant (see Sec. 1.5): 


dV 134, do 
oy et 10.134 
dx > Es c ât + əx ( ) 
2e 0A, 2e do 
= Ë — / / x ~~ / 1 Y 1 1 
| ax fa ay + 7 | avar z (10.135) 


The vector potential A, term is manipulated in the following fashion. It is split into time- 
dependent and time-independent terms. The time-independent term is obtained by doing the 
time integral 


' A 
| dt’ yr = A,(t) = A, (10.136) 


Second, the space integral f dx’A, is recognized as a line integral | dl- A along a particular 
path, and any path will give the same result. Then the use of Stokes’ theorem brings us to 


[a-a=|as-vxa=|as-B, (10.137) 
2 2e f 
® = Do +5 | ds-By -F| at | at Er’, t) (10.138) 


The identity V x A = Bọ gives the term f ds- Bo, which is the magnetic flux under the area 
of the integral. The external magnetic field Bọ = Boy is constant when A is constant. The area 
ds is the distance z from some reference point, which can be the end of the junction, times the 
effective distance in the x direction. The latter is the thickness of the oxide d plus the distance 
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the magnetic field penetrates into the superconductor on each side of the junction. The latter 
distances are the two penetration depths Ap, + àz. In a fixed magnetic field the phase is 


p = Q, + kz + ot (10.139) 
2 
k == Bod + Ar +M) (10.140) 
The magnetic field generates an effective wave vector k. The Josephson current J; now 


depends on the distance z from the reference point. The total current in the sample of length L 
in the z direction is found by integrating in this direction (eV = 0): 


J) = Js(0) sin(®y + kz) (10.141) 
1 (°/? , 2 . (kL 
I, = T [aie = J;(0) sin()| = sin (=) (10.142) 


The theoretical prediction is that the maximum current at zero voltage has a Fraunhofer 
pattern sin(®)/@, where 8 œ By. The Fraunhofer pattern was indeed observed by Fiske, as 
shown in Fig. 10.15 for Sn—oxide—Sn junctions. This experiment is another spectacular 
verification of Josephson’s predictions. Even more interesting results are obtained when there 
is both a static dc magnetic field Bọ and dc voltage V Then the Josephson current has the form 


i@ =Jg(eV) sin(®, + kz + ot) (10.143) 


This form of current is also observed in a variety of experiments which are described in the 
review volume edited by Parks (1969). 
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FIGURE 10.15 The Fraunhofer pattern of a Josephson junction. The experimental curve is the maximum dc current 
which can flow with no voltage as a function of applied magnetic field. Source: Fiske (1978, unpublished) (used with 
permission). 
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Another way to evaluate the integral in (10.142) is to separate the negative values of z, 


and then to change z’ = —z in this integrand 
JF ga MP os 
I, = TT dz sin(®, + kz) + | dz sin(Wy + kz) (10.144) 
—L/2 0 
J (0) T [2/2 L/2 
= 7s) | dz sin(D, — kz) + | dz sin(D,) + ke) (10.145) 
0 0 


where the first integral is for the left side of the junction, and the second integral is for the 
right side. 

A dc Josephson current at zero magnetic field is observed between lead and the high 
temperature cuprates. This observation is evidence that the cuprates are also spin singlet 
states. A Josephson current is not allowed between two different superconductors if one has 
the spins in a singlet and the other has the spins in a triplet. This experiment proves that the 
cuprates are not p-wave superconductors, since symmetry demands that the p-state must have 
a spin triplet. See Chapter 11 for a discussion of p-wave superconductivity. 

The same Fraunhofer pattern is observed for flat junctions, if one side has lead and the 
other side has a cuprate superconductor. Wollman et al. (1995) made a tunnel junction that 
wrapped around a ninety degree corner. The junction now has the geometry shown in Fig. 
10.16(a). The inside corner is a cuprate superconductor, while the outside is lead. It is known 
that lead is an s-wave superconductor. If the cuprates are also s-wave, one should get the same 
Fraunhofer pattern as for a flat junction. However, if the cuprates have d-waves symmetry, the 
tunneling currents in the two arms of the corner should be out of phase. This case can be 


FIGURE 10.16 (a) Geometry of a corner junction between s and d-wave superconductors. (b) Critical current vs. 
applied magnetic field for a corner junction. The predicted current is zero at zero magnetic field since the currents are 
in opposite directions in the two arms. 


660 Chap. 10 e Superconductivity 


evaluated using (10.145), where the two integrals represent the two arms of the corner. 
Changing the relative sign between the two integrals gives 


Mr )sa = Js(0) cos(@o) (=) (10.146) 
0 = z (10.147) 


The pattern of maximum current is no longer Fraunhofer. The predicted behavior of a corner 
junction is shown in Fig. 10.16(b). Precisely this behavior was found for corner junctions by 
Wollman et al. (1995). The Josephson current vanishes at zero magnetic field because the 
current is in a different direction in each arm. The two arms have to have the same length for 
perfect cancellation. Their experimental result is strong evidence for d-wave super- 
conductivity in the cuprates. 


10.2.4. Infrared Absorption 


A unique feature of the BCS theory was the prediction of an energy gap in the excitation 
spectrum of an s-wave superconductor at zero temperature. Electron tunneling (Giaever, 
1960) was the first measurement to convincingly demonstrate the existence of the energy gap. 
However, this experiment was not commonplace at the time of the BCS theory (1957), and 
the conventional method of seeking an energy gap would be to measure the optical absorption 
of the metal in the far infrared. For example, Pb has T, = 7.2K so kgT, = 0.62 meV. The 
BCS prediction is that the energy gap is E, = 2A = 3.52(kgT,) = 2.2 meV = 17.7 cm™!. The 
value found experimentally is E, = 22.5 cm™!. The difference from the BCS prediction is due 
to the fact that Pb is a strongly coupled superconductor. The frequency range is very difficult 
experimentally so that good experimental results for Pb were not reported until 1968 (Palmer 
and Tinkham, 1968). 

An s-wave superconductor at zero temperature should not absorb radiation for œ < 2A. 
The absorption breaks up an electron pair, which creates two quasiparticles. Each quasi- 


particle has an energy of the form E = JE + A*, so absorption starts at hœ = 2A. The 
theoretical discussion follows the original theory by Mattis and Bardeen (1958). 

The rate of absorption in the infrared is given by the real part of the conductivity, which 
is evaluated from the Kubo formula: 


1p. 
x(q, ia) = =z; | de® (Taila, 2) (a, 0) (10.148) 
o(a, io) = —Imru(4, ©) (10.149) 


. e 
ID = — DP +da Cpo (10.150) 
po 


Sec. 10.2 e Superconducting Tunneling 661 


The expression for the current is appropriate for noninteracting particles. There results a 
correlation function of four operators 


. e? 1 / l 
n(q, ia) = — TMV prc +5q)°(p' — 54) 
P } t i 
x | dre!" (T, C} gg o(1)Cpol®)C} a 0 (0)Cyo (0) (10.151) 


In prior discussions of optical absorption processes, it was convenient to set to zero the 
photon wave vectors q. The step q = 0 is not done here because later it would be found that 
the conductivity is zero. It is formally necessary to retain a nonzero value for q, although the 
magnitude of q © 10? cm™ can certainly be neglected compared to the wave vectors of the 
electrons kp œ% 108 cm™!. 

To evaluate the bracket of operators in (10.151), fix one spin, say o = +. The answer is 
twice this result, since the same contribution is found for o = |. The operators in the angle 
brackets can be paired two ways (the pairings with q = 0 are not allowed since q is nonzero): 


(TCi yg (OC (DCH _g g(0)Cyo(0)) = 85-4 Sy prg DIP- T) (10.152) 
+ v=, 5p—-pF (P, DF '(pt+q,t) (10.153) 


These pairings evaluate the correlation function, which is written in both t-space and Fourier 
space: 


(q, io) = 2e ptg [ ace 
ii 7 3m2v p P 24 0 
x [G(p, Z + q, —t) + F (p, —1)F (p + q, 1)] (10.154) 
, 2e? L2 l 
n(q, ia) = Inv P + 5q) B> 


x [G(p, ip)A(p + q, ip + ©) + F(p, ip) F'(p + q, ip + io)] (10.155) 


Note that the vector vertex of the current operator changes the relative minus sign between the 
two terms in (10.152) to a relative plus in (10.155). The two terms in (10.155) can be 
represented by the two diagrams shown in Fig. 10.17. The photon of (q, œ) enters and leaves, 
and couples to the two solid lines of the polarization diagram. The solid lines can be either the 
pair GG or FF' as these are the only two possibilities which conserve particle number at the 
two vertices. One could not, for example, have the combination GF since that does not 
conserve particle number at one of the vertices. The crucial aspect of the above expression is 
the relative sign between the two terms GY and FF". There seems to be no easy way to 
deduce this sign, except by being careful. 


FIGURE 10.17 
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The next step in the derivation is the summation over Matsubara frequencies. The 
summation for F Ft was given previously in (10.117), while the other one is 


1 
= FQ. ip\AK, ip + io) 
ip 


= [1 — np(E,) — nE #2 upt% 
MEME) EEN oE, — E, io + E, +E, 


uzu? vu 
+ [n (E) — n (EQ ——— -4 (10.156) 
Bee E iœ +E,- E, io- E, +E; 


At nonzero temperatures, there are a number np (E) of quasiparticle excitations with energy £ 
which can absorb the radiation even for hœ < 2A. The interesting case is the limit of zero 
temperature, where there is no absorption until A@ > 2A. At zero temperature the nonzero 
terms are 


2e? 
n(q, io) = => (p+ 3.4) 
P 
2 42 2,2 
< | p+ap — “pYp“p+aľpra _ MpYp+a — Up Yp4n+q’ptaq (10.157) 


The retarded function is obtained from iœ — œ + ið, and taking the imaginary part of the 
retarded function brings us to the real part of the conductivity as v > oo for œ > 0: 


2ne? l dp 


— 2 2 
o(q, ©) = —— On’ (p +q) [už ath — UpUpYpsrq¥prql(@ — Ep — Ep+a) (10.158) 


3m? O 


This expression has a reasonable form. The photon is absorbed by having its energy h@ 
converted into two quasiparticles with separate energies E,, and Ep+q: 

The combination of coherence factors uf, 4¥5 — UpUpUp+q¥p+qdetermines the rate of 
absorption. This quantity vanishes as q — 0. It was remarked that q « p, so that one should 
indeed take this limit. Then one would conclude that the absorbing power of the super- 
conductor was very weak, at least for the process of creating two quasiparticles. This 
conclusion is valid in the case where momentum is a good quantum number, so the two 
quasiparticles made by the photon have momenta which are related, i.e., p and p + q. 

Mattis and Bardeen assumed that momentum was not conserved. Instead, the two excited 
quasiparticles have momentum p and p’ which are unrelated. The lack of momentum 
conservation is characteristic of the so-called dirty superconductor. The momentum breaking 
might occur, for example, because the experiments are done on evaporated metal films which 
are not crystalline. Alternately, there might be phonons or impurities which scatter the 
particles. In any case, it is assumed that energy is conserved but not momentum. Separately 
sum over p and p’ which leads to the expression 


CO 


Cy [~ ; 
o(@) = a| a, | d5,0(@ — E, — E,y)[vuyy — UyVyUy Vy | 


where Cy is a constant which will be determined later. The terms in the coherence factors v? 


and us, which are linear in €, and éy, vanish because they are odd functions of 6, and &y. 
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Then change variables to E = E, and E’ = E,,, and derive the expression 


C, f° oe) A2 
olo) = =| aEp(E)| dE’ p(E’)6(@ — E — D: — ) (10.159) 
© JA A EE’ 
The F’ integral is done by energy conservation: 
Cy wo—A A2 


This integral has the same form as (10.105) and is evaluated in the same manner: change 
variables of integration to x, where 2E = œ + x(@ — 2A), and the integral has the form for 
complete elliptic integrals: 


@—2A 
„„2=24 (10.160) 
O 7 _ ee ee 
olo) = zo O(a 2A)\(@ 2A)| dx (a — x2)(1 — 2x2) (10.161) 


o(@) = 0 Olo — 2A)[w + 2A)E(«) — 4AK(a)] 


The absorption for a normal metal has A = 0, which in the same units is Cy. The ratio of the 
conductivity in the superconductor to that of the normal metals is 
0,(0) 1 
S.(@) > Om 2A)|(@ + 2A)E(«) — 4AK(a)] (10.162) 
which is the result of Mattis and Bardeen. 

The experiment measures the conductivity in the normal metal and the superconductor at 
the same temperature. The metal is made normal, at the low temperature, by the application of 
a small magnetic field. The experimental results of Palmer and Tinkham (1968) for Pb are 
shown as the points in Fig. 10.18. They compare well with the theory of Mattis and Bardeen, 
Eq. (10.162), which is shown as the solid line. There is no absorption until the energy gap of 
22.5cm7!. Then the absorption rises gradually with increasing œ. It does not, as in the case of 
tunneling between two superconductors, rise discontinuously at the threshold frequency. 

The agreement between theory and experiment is obviously excellent. In fact it is 
initially surprising that the agreement is that good. After all, Pb is a strongly coupled 
superconductor, and there is no particular reason to expect the BCS theory to work well for 
this metal. This question was investigated by Shaw and Swihart (1968), who solved the theory 
of infrared absorption using the strong coupling theory. They found that the results were 
nearly identical to the predictions of the BCS theory. This experiment is not sensitive to the 
strength of the coupling. There are, however, numerous other experiments which can 
distinguish between these theories. 

Many far-infrared experiments have been performed on high-temperature super- 
conductors. The results have generally been inconclusive. There is no peak which can be 
identified as due to the energy gap of the superconductors. The cuprates are oxides, and have 
many optical phonons in the same range of frequency, which complicates the interpretation. 
Of course, if the order parameter has d-wave character, then there is no energy gap in the 
infrared. The node in the gap function means that there is a distribution of energy gaps, which 
extends down to zero energy. 
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FIGURE 10.18 Far-infrared absorption in superconducting lead at 2 K. The solid line is BCS theory, and points are 
experimental. Source: Palmer and Tinkham (1968) (used with permission). 


10.3. STRONG COUPLING THEORY 


The BCS theory is based on a simple approximation (10.2) to the attractive interaction 
between two electrons. This theory successfully describes the properties of many elemental 
superconductors. Nevertheless, there are some superconductors for which the electron- 
phonon coupling is quite strong, and here the BCS theory is not an accurate description of the 
superconducting state. Instead, one must solve the gap equation for a realistic interaction 
between two electrons, which includes both the attraction due to phonons and the repulsion 
due to the screened Coulomb interaction. It is important that the phonon term be fully 
retarded, 1.e., that the interaction be frequency dependent. This strong coupling theory was 
described by Eliashberg (1960). One of its important features is that the gap function A(œ) 
becomes a complex function of the real frequency œ. Alternately, it is a real function A(ip,,) of 
the imaginary frequency p,. In addition, the self-energy function of the electrons, due to 
electron-phonon interaction, must be retained and plays an important role in the analysis. The 
discussion of strong coupling theory follows Scalapino et al. (1966). 

The starting point in the theory is the screened interaction between two electrons, which 
is written in the conventional way as the sum of a screened Coulomb interaction and a 
screened phonon interaction (see Secs. 7.3 and 7.4): 


Vee(q, im) = V.(q) + Mi DIAG. iw) (10.163) 
Ane? 

— 10.164 

V.(q) Zela ( ) 

ma = (10.165) 


e(q) 
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The Coulomb term V, is taken to be instantaneous, since the frequency dependence of the 
dielectric function e(q, œ) is not important for the low frequencies of interest, œ < œp. 
Similarly, the screened electron-phonon coupling constant M,(q) is also taken to be a 
function of wave vector but not frequency. These were the same approximations used in Sec. 
7.4 to discuss the electron—phonon effects in normal metals. 

The objective is to calculate, self-consistently, the properties of the correlation functions 
G, F, and FY. The last two are again equal, so it is sufficient to find one of them. They have 
the same definition (10.31) as before. Now they will be evaluated by directly expanding the S 
matrix and summing the terms which are found in higher order: 


F(p, t— v) = —(T;Cp o(t)Ch(v’)) = (Tı C wo (T)SCH(1’)) 
F(p,t— T’) = (T,C_py()Cpy(1’)) = (T: p (DSCpy (7) 
F'(p, t= T) = (T: (DC a) = (TES e) 


Two different self-energy functions are retained in this expansion. Both are one-phonon self- 
energies, where the electron line in the self-energy can be either Y or F 


S(p, ip) = — (<4 om sgh Vala iqy9(p + a ip + iq) (10.166) 
W(p, ip) = — ES 2 Vq, iq) F (p + q, ip + iq) (10.167) 


These self-energies are shown as Feynman diagrams in Fig. 10.19. The dashed line is Ve in 
(10.163). The electron line, in the self-energy expression, contains the Green’s functions Y or 
F which are to be determined self-consistently. This theory neglects the vertex corrections, 
which are the self-energy diagrams of the type shown in Fig. 10.19(c). This neglect is 
customarily justified on the basis of a theorem due to Migdal (1958), who argued that such 
terms were smaller by a factor of ./m/M, where m is the electron mass and M is the ion mass. 
The difficulty with his argument is that superconductivity itself (i.e., the Cooper instability) is 
caused by a vertex diagram. A theorem which asserts that all vertex diagrams may be 
neglected is obviously unreliable. The strong coupling theory is described in its customary 
form, which does neglect these vertex corrections. 

First discuss the wave vector dependence of W(p, ip,,). The wave vectors of interest are 
on the Fermi surface. Since the Fermi surface is not a sphere, the magnitude of the Fermi 
wave vector will depend upon the direction in the crystal. For an isotropic s-wave super- 
conductor, then just ignore the variation in the Fermi wave vector. This approximation is 
tantamount to neglecting the wave vector dependence, and writing the gap as W(ip,,). There 
are many cases where this approximation is poor, and one should include the dependence of 
the gap upon direction in the crystal. Because of crystal symmetry, this dependence can 
usually be accomplished with a few functions, which are the basis functions of the group 


(a) S(p,ép,) (b) W(p,ép,) (c) 
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which describes the crystal symmetry. For cubic crystals these are called Kubic Harmonics 
(van der Lage and Bethe, 1947). For other crystals, similar functions can be constructed. For 
example, in high-temperature superconductors, recent experiments have indicated that the gap 
may have d-wave symmetry. The symmetry is x? — y?. Define the angle p for the two- 
dimensional square lattice by tan() = &,/k,. Then a good approximation to the gap function 
is W(p, ip, ) = W,(ip,) cos(2). The angular function cos(2@) gives the proper symmetry of 
being plus one along the +x axes and minus one along the +y axes. This ansatz form could be 
used in the gap equation. In this case it is relatively easy to integrate out the angle © so that 
one is left again with solving a matrix equation where the only important variable is the 
imaginary frequency. 

Normally the expansion of the S matrix leads to a Dyson equation of the typical form: 


Gp) = GOP + EPP) (10.168) 


The equivalent expansion is derived for the superconductor. The situation is obviously more 
complicated, since there are two self-energy functions S(p) and W(p), and two Green’s 
functions Y and F. The noninteracting function G(p) = 1/(ip — Ép) as usual, but obviously 
there is not an equivalent form for F © so F = 0. The relevant Dyson’s equations are, in a 
four-vector notation p = (p, ip) 


Gp) = GOPI + SPFP) -WPF p) 
F (p) = GV (PIW PZP) + S(-P)F"@)] 


These equations are shown diagrammatically in Fig. 10.20. The key to constructing the 
equations is to remember that the vertex conserves particle number, so that the two electron 
arrows must point in the same direction; i.e., one can only have vertices of the form below. 
These equations are simplified because W(—p) = W (p). Dyson’s equations are quite similar 


(10.169) 


| l 

| | 

l l 
——_»—|_ »—— —~q——l__.q¢@—— 


to the BCS equations (10.56). This resemblance is made more apparent by rewriting (10.169) 
using Y°(+p) = 1/(+ip — &,) to obtain 

lip — &, — SPNZ + WPF’ =1 (10.170) 

lip + &, + S(—p)|F' + WP) =0 (10.171) 

The BCS equations (10.56) are obtained in the approximation where W(p, ip) = 


A = constant, and the other self-energy S(p,ip) is ignored. The more general equations 
(10.171) are now solved. The gap function W(p) is an even function of ip, as already stated 
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above. The other self-energy S(p) is not. It is convenient to break it up into its symmetric and 
antisymmetric parts: 


S(p) = S.(p, ip) + ipS,(p, ip) (10.172) 


where S, and S, are both even functions of frequency ip. Furthermore, the function Z(p, ip) is 
defined as 


Z(p, ip) = 1 — S,(p, ip) (10.173) 


The notation Z usually is applied to a renormalization coefficient for the single-particle 
Green’s function. Here the definition is different, but the physics is similar, since Z again turns 
out to be another type of renormalization function. Now it is easy to solve the coupled 
equations (10.171) by writing 


ip —&, — S(p) = ipZ(p) -&,, — & = & +SP) (10.174) 
ipZ(p) + & 

G(p) = =P 10.175 

p) ipZ(p) — é; + W(py] 

F(p) Wip) (10.176) 


PZP - E + Woy) 


These equations are similar to those for the BCS theory (10.58). The present theory requires 
that the functions W(p), Z(p), and S,(p) are found self-consistently. 

Eliashberg examined the retarded function Re[S,(p, )| in great detail. He showed that it 
was a constant, independent of (p,@), which therefore just renormalizes the chemical 
potential. This conclusion should not be surprising. The earlier investigation in Sec. 7.4 of the 
electron-phonon properties of normal metals showed that Re[X(p, œ)] was an asymmetric 
function of frequency, so that the symmetric part of the self-energy is small. This conclusion 
does not change in the superconductor. Neglect S.,. 

The other functions of interest Z(p) and W(p) are both functions of (p, ip). The wave 
vector dependence, for s-wave superconductors, is unimportant, since everything happens at 
the Fermi surface p ~ kp. It is the energy dependence of these functions which provides the 
interesting phenomena. Actual crystalline superconductors have anisotropic Fermi surfaces, 
which cause the functions to vary with angle around the Fermi surface. This dependence is 
neglected, and isotropy is assumed. In d-wave or p-wave superconductors the angular 
dependence of the order parameter is also important around the Fermi surface. 

For s-wave superconductors the notation is altered to suppress the dependence on wave 
vector. These functions are written as S,(ip), W(ip), Z(ip), etc. The main dependence of 4(p) 
and F (p) on p is through the factor ¢, ~ §,. All the wave vector dependence enters through 
the integrals of the Green’s functions: 


> pal a Pte tin 
| se ip„) = | 4% EIME JZW? 
” de Foipj=-| qa- O -VO 
| LF (P, ip,) = | a, E +W +O, VZ + W2 


These integrals are evaluated by contour integration, say by closing the contour in the upper 
half plane. 
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The earliest work on superconductivity converted these gap equations to real frequency 
and solved them as integral equations over the energy and frequency. That process is 
cumbersome, since each retarded function (W, Z) has a real and imaginary part. An alternate 
procedure was done by Vidberg and Serene (1977). They solved the gap equations as a 
function of complex frequency. The advantage is that all functions are real when expressed as 
a function of complex frequency. The solution to the self-consistent gap equation is much 
easier using only real functions. The hard part of the numerical work is then the analytical 
continuation of the numerical results to the real frequency axis. Since p, = (2n + 1)mkpT, the 
relevent information is (2n +1). Label these functions as W(ip,) = W(2n+ 1), S(ip,) = 
S(2n + 1), etc. The end result of the computer solution is a pair of vectors representing these 
two functions. The best way to convert them to the axis of real frequency is to use Padé 
approximants (Baker, 1975). Vidberg and Serene showed this method gives excellent results. 

The goal is to evaluate the gap Eqns. (10.166) using the Green’s functions in (10.176). 
These integral equations will be reduced to a single independent variable, which is the 
imaginary frequency. The first step is to do all the wave vector integrals, whose variables are 


E = |a |a ag (10.178) 


where v = cos(0) is the angle between p and q. Change the integration variable to 
p' = JP + ¢@ + 2pqv, or p'dp' = pqdv, and as a second step change p'dp' = md¥: 


m [ 2n l 
E ="| qaq | ax | do => |q | ae (10.179) 
P Jo 0 Up 

The q integral is over the Brillouin zone; the inclusion of Umklapp processes extends the 
integration to infinity. The above change of variables restricts |p — p'| < q < |p +p’|. Since 
both (p, p’) are near to kp, the wave vector integral is restricted to 0 < q < 2kp. The d& 
integral will be limited to values of |& | < @p at the Fermi surface, since the integral 
converges rapidly outside of this range. With this change of variables, the wave vector 
integrals are quite simple to do, at least in a formal sense. The integral over d&’ was already 
performed in (10.177). The remaining integration over the coordinates d*q provides a 
constant u, for the electron—electron interaction, and the exact definition of the function 
«°F (w) for the phonon term. The latter was defined in Sec. 6.4 


1 [Pa % _ @ ("dq 
= ao | iam ate) = ahr |, ae (10.180) 
2 l d’q - 2 
PFO) = r | Sos LA So -A (10.181) 


After performing these wave vector integrations, the self-consistent equations (10.166) for the 
two self-energy functions have been reduced to manageable proportions: 


„i (Pn + 4m)Z(ip + iq) oF (w) 
S(ip,) = FE (os +e .) 


B ian (py + dn) Z? + Wip + iq 
W(ip,) =~ 3 _ Witi _ (2 [odo FO) _ ne) 


: 2 2 
P itm J (pn + an)?Z? + Whip + ia} O Tan 


The Coulomb term p, is a dimensionless constant, which acts to reduce the gap function. 


(10.182) 
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The formulas (10.182) for S(ip,,) and W(ip,) can be simplified. First change the notation 
to S(Qn+ 1), W(2n+ 1). Secondly, introduce the gap function A(2n+ 1) and phonon 
interaction V(m) 


A(Qn +1) = tone (10.183) 
V(m) = 2| ade ae (10.184) 


Note that V(0) =A, where here À is the dimensionless strength of the electron-phonon 
interaction. Also note that V(m) < à for m Æ 0. The coupled gap equations are, after chan- 
ging summation variable to k,, =p, + 4m 


S(2n + 1) = - FL AQm + 1)[V(n — m) — p,] (10.185) 
A(2m ++ 1) = (10.186) 
J (Ky) + AQm + 17° 
Z(2n + DAQ + 1) = 3 D(2m + DIV (n — m) — p] (10.187) 
Dm + 1) = hem) (10.188) 


V Kn)” + A(2m + 1° 


Examine the parity of these functions. The phonon interaction V(—m) = V(m) is an even 
function of its argument. The discrete frequency is odd: k, = (2n + 1)nkgT changes sign if 
(2n + 1)—> —(2n +1). This feature makes A(2n+1) an odd function 
(A(—2n — 1) = —A(2n + 1)), which also makes S(2n + 1) an odd function. This term is 
identified as S(2n + 1) = ip„So(ip„), so the factor of Z is 


Z(2n+1)=1 +5 DAG + DIV (n — m) — p] (10.189) 
= 1 + se Mam + 1)V(n — m) (10.190) 


where the term in u, vanishes when one sums over positive and negative values of m due to 
the oddness of A. The above formula (10.190) can be used to show that Z(2n + 1) is an even 
function of its argument (i.e., change m = —m' — 1, n = —n' — 1). 

The remaining issue is the symmetry of the gap function A(2n + 1) as defined by Eq. 
(10.187). Since Z and Vare both even functions, then the gap function can be either pure even 
or pure odd. A gap function which is odd in frequency has the feature that it must vanish at 
zero frequency A(ip, = 0) = 0. In this case it is not superconducting. The physical solution 
has the gap an even function of frequency. The two equations to be solved are recollected 
here: 


ZQn +1) = 1+ E AQm + DVn — m) 


2n+1 
Z(2n + DAN + 1) = nkgT Y O2m + DIV (n — m) — u] 


(10.191) 


They are called “the gap equations”. 
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FIGURE 10.21 The real and imaginary parts A, and A, of the complex gap function as a function of energy œ for 
lead at T = 0. Source: Shaw and Swihart (1968) (used with permission). 


These two equations must be solved self-consistently for the two real functions 
A(2n + 1) and Z(2n + 1). The inputs for the calculation are the kernels V(m), which depends 
on the phonon properties through «*F’, and the Coulomb factor u,. Values of the Coulomb 
constant p, for actual superconductors are about 0.1-0.2. The temperature 7 enters explicitly 
in the prefactor T in the equation for A. It also enters into the function k,, which is found in A, 
@, and V 

The first solution to the gap function, using real frequency, was by Shaw and Swihart 
(1968) for lead at T = 0 as shown in Fig. 10.21. They used the «?F of McMillan and Rowell 
(1965), which is shown in Fig. 7.13(a). The real part of the gap function Re[A(@ = 0)] = A, 
equals the value 1.34meV found from electron tunneling. The imaginary part of the gap 
function Im[A(w)] = A, is zero for œ < Ap. Both real and imaginary parts of A(œ) are 
obviously strong functions of œ. Shaw and Swihart also calculated the infrared absorption and 
showed excellent agreement with the results of Palmer and Tinkham. The electron tunneling 
data for «*F, along with the strong coupling theory, provides an excellent description of other 
experiments such as infrared absorption. The later calculations of Vidberg and Serene (1977) 
using the method of imaginary frequency and Padé approximants, obtained the same result 
for Pb. 


10.4. TRANSITION TEMPERATURE 


The transition temperature of a superconductor is called 7,. It is one of the easier 
quantities to calculate. A simple method was suggested by Owen and Scalapino (1971). It has 
been adopted widely, and is described here. The trick is to retain the imaginary frequency 
throughout the calculation. One advantage of using imaginary frequency is that all of the 
quantities are real functions, which simplifies the computation. A second advantage of the 
method is that it is relatively easy to include embellishments such as the influence on T, of 
impurity scattering, electron—electron interactions, etc. Here a calculation is presented for the 
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simple case of an isotropic s-wave superconductor. The extension to more complex systems is 
described by Allen and Mitrović (1982). 

For temperatures larger than T, the superconducting energy gap is zero. At temperatures 
equal to T, the gap starts to have nonzero value. The transition temperature is the place where 
the energy gap is infinitesimally small. The value of T, is found by setting A = 0 in all of the 
denominators of (10.191) 


l 
Z(2n+1)=1 +g g g2 sentam + 1)V(n — m) (10.192) 


Zn + DAQa + 1) = phen 


where sgn(x) has the sign of x. The equation for Z(2n + 1) no longer depends upon the gap 
A(2n + 1). It can be evaluated at the start of the calculation. Recall that Z comes from the self- 
energy of the electron. At T, the gap is zero so one can use the self-energy of the normal state. 

The function Z(2n + 1) now has a simple form. This fact is demonstrated by evaluating 
it for small values of (2n + 1). In the summation over m on the right, the terms are presented 
in two brackets: the first is when (2m + 1) has positive values as m = 0, 1, 2, etc. The second 
is when 2m + 1 has negative values as m = —1, —2, —3 etc. Note that V(—m) = V(m) is 
symmetric. Examine the four cases that n = 0, —1, 1, —2 


Z(1) = 1+ {{V(0) + V1) + V(2)+---]-[V) + V2) +-- J} 
=14+V(0)=1+A 
Z(—1) = 14+ (-1{(V(1) + V2) +. V3) + ---]-—[VO) + V0) 4+ V2)4---}} 
= 1 + V0) =Z(1) 
Z(3)=1 +H + VO) +O +---] —[V2)+ V3)+---} 
Z(3) = 1+4[V(0) + 2V (1) 
Z(—3) = 14+ 4[V(0) + 2V(1)] = Z(3) 


1 n 


The renormalization factor Z(2n + 1) is an series with 2n + 1 terms. Note that V(0) =A, 
where à is the dimensionless electron-phonon coupling constant. The renormalization 
function is an even function of imaginary frequency. 

Now examine the gap equation (10.193). The gap A(2n + 1) is infinitesimally small. The 
gap equation is a linear matrix equation. The unknown quantity A(2n + 1) is a vector. The gap 
equation is actually an eigenvalue equation. The solution to this equation is obtained by 
setting the determinant of the matrix equal to zero 


det|M,,,| = 0 (10.194) 


V(n—m)—w 
Mn = OnmZ(2n + 1) — — 10.195 
(2n + 1) |2m + 1| l ) 
This “eigenvalue” equation is unusual, since there is no “eigenvalue”. The temperature T is 
contained in the factor V(m) which enters into Z and M_,,,. This temperature is varied until one 
finds the highest value at which the determinant vanishes. This temperature is then T.. In 
practice the determinant does not have to have a large dimension. For À > 1 then its 
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dimension need only be about 10(—9 < 2n + 1 < +9) for accurate answers. This dimension 
of determinent is easy for present computers. The solution is a symmetric function of 
frequency, in that A(—(2n+ 1)) = A(2n + 1). Knowing this fact, the dimension of the 
matrices can be reduced to half. 

The Einstein model provides a simple example. Then all of the phonons have the same 
frequency œp and &?F(@) = ®pA5(@ — @p)/2. In this case it is useful to introduce the 
dimensionless constant b = 2rkgT /(A@,). The gap equation becomes a search for the largest 
value of b which satisfies the equation 


À 


A(2n + 2m + 1) 
A(2 1)Z(2 l r 10.197 
(Qn + D)ZQn +1) =) dob Al + bm] ( ) 
À 
1) = 10.198 
Z(2n+1)= 1+ Z i + ewer ( ) 


The gap equation (10.197) for the Einstein model of phonons is easily solved to give a graph 
of b vs. à, which is shown in Fig. 10.22. Above A > 1 the curve rises slowly with increasing 
coupling strength à. In fact the dependence goes as b > VA. There is no saturation of 
b = 2nkgT./A@® , which means no saturation of T,. The higher is A then the higher is T,. 
The asymtotic limit of large à can be found in a simple way. Assuming that b becomes 
very large, then V(m) becomes increasingly small for modest values of m. In the limit of large 
b retain only the values V(0) = A and V(1) = A/(1 + b?). The gap equation can be solved by 
using only a matrix of dimension two. It is necessary to retain only the gap components A(1) 
and A(—1), and the renormalization parameter Z(1) = Z(—1) = 1 +A. When u, = 0 the 
equations are | 


A(—1) 
AONI +2) = 240) +5 ae (10.199) 
ACDA +4) = faco + 4 (10.200) 


The matrix equation is 2 x 2. Setting the determinent equal to zero gives two solutions for 
b(X). The only one for real values of b is 


b> =1~A-1 (10.201) 
ho 

kT. =—2VA-1 10.202 

B C 2m ( ) 


Keep in mind that this solution is approximate and only valid in the limit that à > ov. 
However, this solution clearly shows that T, increase as the square root of the electron- 
phonon coupling constant at large values of the coupling constant. The exact asymptotic limit 
as A —> 00 is kT, > 0.15% Vd (Allen and Mitrović, 1982). 

The electron—electron parameter u, plays the same role as V(m) for the electron-phonon 
interaction. Combining these two interactions gives a gap equation with an effective inter- 
action of (V — u,). For A > u, the interaction is attractive for small values of m but becomes 
negative for larger values of m. The electron-phonon interaction is attractive only over a 
limited range of imaginary frequency. 

Figure 10.22(a) shows the solution of the gap equation for the Einstein model assuming 
that u, = 0.1. The solid line has u, = 0 compared to the dashed line with u, = 0.1. Electron- 
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FIGURE 10.22 (a) The solution to the gap equations is given as b(A). The solid line has p, = 0, while the dashed 
line has u, = 0.1. (b) The eigenvectors A(2n + 1) for the two cases. 


electron interactions tend to reduce the value of b = 2nkgT./h@y. Another interesting 
function are the eigenvectors A(2n + 1). Figure 10.22(b) shows the relative values of this 
function for the case that 1 = 1. The two curves are the cases that u, = 0.1 and 0. Since the 
gap function is symmetric in (2n + 1) only positive values are shown. Of course, the gap 
function vanishes at T.. Here we assume the temperature is approaching T, and the gap 
functions are very small. In this case only their relative size matters. Note that the two curves 
are very similar near to small values of frequency (2n + 1). However, they differ markedly at 
large frequency. For the case of no electron—electron interaction (p, = 0), the relative values 
of A(2n + 1) get very small and vanish at large (2n + 1) which justifies the use of matrices of 
relatively small dimension in solving the gap equation. 

The opposite behavior is found when u, 40. At large values of (2n +1), the gap 
function A(2n + 1) does not go to zero but goes to a constant. Then there is no argument for 
using a determinent of small dimension. The problem is that the effective interaction in the 
gap equation V(m) — u, does not vanish at large m but goes to a constant — u.. Therefore one 
should use large determinents. However, there is a method of using determinents of smaller 
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dimension, which entails rescaling the Coulomb coefficient. This procedure is described in 
the review by Allen and Mitrovic (1982). 


Problems 
1. Consider the mutual scattering of a fermion and a boson, i.e., a >He and a tHe particle moving 
parallel in time. Use Green’s functions to derive the occupation number factors which enter the 


scattering integral. 


2. Calculate how the pole in the Cooper instability varies with the center of mass momentum P of the 
pair. 


3. What is the momentum distribution n, of electrons in the superconductor at T = 0? 
4. The entropy per unit volume of a superconductor in the BCS model is 
2k 
S=- S rE) In[n-(E,)] + [1 — nr (E) nfl — nr (Er) 


Use this expression to derive the specific heat of the superconductor. Then use A(T) ~ ~T, — T to show 
that the specific heat is discontinuous at the critical temperature. 


5. Write the Hamiltonian (10.35) in the following form: 
i t 
Her = 2 EkCkoCko — 2 A(k)(Ch, C ky + C-k, Cay) 


A(k) = — >) V(qyiC_x_ay Ck+qt?) 
q 


Solve the effective Hamiltonian H,ț by a canonical transformation, and reduce it to the form 


Hott = Eo -+ O + BIB.) (10.203) 
Cyr = cos(O,)o%, + sin(®,) By, (10.204) 
C_p, = cos(6)B, — sin(O,,)o (10.205) 


6. Calculate the Pauli spin susceptibility of the BCS superconducting state. Show that the formula at 
nonzero temperatures is 


1 = 2462 (4 /dE Dnr (E) 


7. Derive the formula for the coefficient in the Josephson current Js(0) = (tA/2)(0)/e) by doing the 
integral in (10.121) for eV = 0. 


8. Evaluate J,(eV) in (10.127) at zero temperature by making the change kgT ip — | dp/(2n) and 
then the analytical continuation iœ —> eV. Define A(eV) at T =0 by using J,(eV) = (0) /e)A(eV). 
Show that: 

1/2 


2 
(a) AeV =0)=A,Kq): q={1- As 
A7 


where A, and A, are the smaller and larger of the two energy gaps. 
(b) When A, = A, = A, then A(eV) = AK(eV/2A). 
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9. Use strong coupling theory to derive a formula for the coefficient J;(eV) for the Josephson 
tunneling. 


10. Derive the strong coupling theory formula for the infrared absorption of a dirty superconductor. 


11. At large values of À the gap equation for u, = 0 goes as b > æo vA where the best numerical 
solution gives a% = 0.1481 (Mahan, 1997). Our first estimate solved the gap equation retaining only 
A(+1), and gave %) = 1. Make a better estimate of this constant by solving the gap equation while 
retaining A(+1), A(+3). (Hint: one cannot solve for b analytically. But letting & = 2/b? so that 
V(m) = &/m? gives a simple equation for €.) 


12. How does the electron—electron parameter u, change the result in (10.202)? 


13. For impurity scattering, take the self-energy of an electron to be given in terms of a T matrix 


X(p, i =n, [|< IT, p (10.206) 
Po Pn? = Vane Peta ip, — epg 
This interaction does not change the energy of the electron, but does change its wave vector. Use this 
function to calculate a parameter 


dP 
Yi = Jre [Tp pq! (E — Ep+q) (10.207) 


which enters into the gap equation and also into Z. Show that these two contributions cancel in the gap 
equation for an isotropic superconductor. In this case, impurity scattering does not change the value of 
T.. This was first shown by P.W. Anderson. 


14. Use the Thomas—Fermi model for the dielectric function to evaluate u, for the homogeneous 
electron gas. Give numerical estimates for r, = 2, 3, 4. 


15. Repeat the above calculation for the two dimensional electron gas, where the electron dispersion is 
parabolic and the Fermi surface is a circle. The major change is in the dielectric screening function. 


Chapter 11 
Superfluids 


Helium has two common isotopes, *He and *He. Each isotope can be separated and a liquid 
formed at low temperatures which is nearly pure "He or pure *He. Each has unusual prop- 
erties and displays collective behavior of a unique character. The boson liquid He shows a 
phase transition at T, = 2.172K to a superfluid state which is similar to Bose-Einstein 
condensation, although vastly modified by the strong interparticle interactions. Similarly, the 
fermion liquid *He develops a Fermi distribution at low temperature, and the particles have a 
superfluid transition which is similar to the superconducting transition in a metal. Of course, 
now it is occurring in a liquid, of electrically neutral atoms, so there is no Meissner effect, but 
there is pairing. However, it also has a unique character, since the atoms avoid the usual 
singlet pairing common to metals and instead pair with the spins aligned parallel. The triplet 
pairing, in turn, leads to many new phenomena and a richer phase diagram, which was 
discovered by Osheroff et al. (1972). The third superfluid in this chapter is the two-dimen- 
sional electron gas under strong magnetic field, which shows the quantum Hall effect and the 
fractional quantum Hall effect. It is also a highly correlated fluid. 

Quantum fluids is a subject where Green’s functions are often abandoned in favor of 
other techniques. There are some calculations for which other techniques are better and for 
which the Green’s function method gives awful results. One example is the discussion of the 
ground state properties of quantum liquids. Green’s function techniques have been unsuc- 
cessful in the study of the ground state properties of classical fluids, and other techniques are 
now used for these high-density, strongly interacting, and highly correlated systems (see 
Percus, 1964). It is understandable that the ground state of quantum liquids is similarly 
difficult to treat with Green’s functions. The description of the ground state properties of the 
system will require techniques beyond those introduced here, and often these results only 
summarize. However, when discussing the excitation spectra of the liquid, which are crucial 
for their superfluid properties, familiar techniques are used, such as Green’s functions or 
perturbation theory. 


11.1. LIQUID tHe 
The first discussion of tHe uses pairing theory. The important approximation in pairing 
theory is that the particles’ interactions can be approximated as a summation of pairwise 


events. Of course, the potentials between particles are a summation of pair interactions: 
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> V(r; - r;). But pairing theory makes a much stronger assumption: when one particle 
interacts with another, it is not simultaneously interacting with other particles. Pairing theory 
would be a good approximation in a gas of low density, where the particles collide occa- 
sionally, and most collision events are binary. But this assumption is not obviously valid in a 
liquid. In fact, it is now known that the assumption is terrible and that pairing theory is a very 
bad approximation when applied to liquid helium. It makes numerical predictions of 
measurable quantities, such as the speed of sound in tHe, which are a factor of 10 higher than 
experiment. One should not regard the pairing theory as a serious theory of liquid helium. 

There are still several reasons for introducing pairing theory. Historically it played an 
important role in the history of many-particle physics. The major ideas were introduced by 
Bogoliubov (1947), who proposed the idea that the particles interacted in pairs. Although this 
theory was unsuccessful when applied to helium, virtually the same theory works for 
superconductivity. The BCS theory is just the Bogoliubov theory, changing the operators from 
bosons to fermions. Of course, there it works since the particles do have pairwise interactions 
and the interactions are weak. In superfluid helium it does not work, although much effort was 
expended over a period of years by many investigators before this conclusion became 
obvious. 

Another reason for introducing pairing theory is that it contains many ideas which are 
qualitatively correct such as the condensate. It provides a good introduction to the subject. 
Later it is shown how the quantum liquid properties are actually calculated using correlated 
basis functions. 

The starting point for this discussion is a Hamiltonian which contains the kinetic energy 
of the particles and also the pairwise potential V(r) 


2 

Pj 
H= J 41D y(y. —y. 11.1 
E HEVE =) (1.1) 


The helium particles are treated as spherically symmetric objects, and the electronic excita- 
tions are ignored. This approximation is accurate, since the atoms have a kinetic energy of 
roughly 15 K, using temperature units, while the electronic excitations have a minimum of 
20 eV. The *He atom is considered to be entirely structureless and is represented as a single- 
boson particle with a potential V(r) when interacting with other similar bosons. The *He atom 
has additional degrees of freedom associated with its nuclear spin. The spin is an important 
property, since it is what makes *He have its fermion character. The helium—helium potential 
which was used most frequently in the early days of physics was a Lennard-Jones potential, 


vin =a (°)"-()] ana 


e = 1.484 x 10°" J (11.3) 
o = 2.648 Å (11.4) 


which is often called a 6-12 potential. The parameter € ~ 10K is the maximum well depth, 
and o is the hard core radius, where the potential rises steeply. These parameters were 
obtained by fitting various experimental results to this potential function (see Hirschfelder et 
al., 1954). It should be appreciated that this potential form was chosen for its mathematical 
simplicity and not because it was a good approximation to the actual potential shape. 
Certainly the long-range attractive potential behaves as r~° from van der Waals forces, but the 
repulsive part is probably not r~'*. Recent efforts to deduce this potential function have 
produced better versions. 
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In the pairing theory the Hamiltonian is written H = Họ + V, where Hp is the kinetic 
energy term. All operators are expanded in the basis set of Hj, which are plane-wave states. 
Rewrite the Hamiltonian (11.1) as 


1 
H = 5 £, C}C, + =È VAC Cy rg Ck (11.5) 
p qkk’ 
2 
b= 5-H (11.6) 


An additional spin index should be added when describing *He. The effective interaction 
V(q) is the Fourier transform of the interparticle potential V(r). One immediate problem is 
that the Fourier transform may not exist. Certainly there is no Fourier transform for a 
Lennard-Jones potential. It is shown later that the potential can be replaced by a 7 matrix, 
which always exists regardless of the potential strength or shape. This divergence is formally 
eliminated until the T matrix is introduced by assuming the divergence is cut off by a 
parameter g and by later letting g — oo. The nonexistence of V(q) is not really a serious 
problem, since the replacement of scattering properties by a T matrix is formally exact for 
binary collisions. This theory would give a good description of a dilute gas. 


11.1.1. Hartree and Exchange 


Liquid helium is not a weakly interacting system, but it is treated as such in pairing 
theory. It is assumed, for the present discussion, that the potential terms are weak. The self- 
energy is found from the first terms which occur in the S-matrix expansion. The first two 
contributions are identical to the terms found for the homogeneous electron gas, which has 
the same formal kind of Hamiltonian as (11.5). They are the Hartree and exchange energies: 


EM) = nV (a = 0) +~ T7(Ep eq) A (11.7) 
q 


The + signs refer to *He(+) and *He(—), respectively. The *He case is identical to that of the 
electron gas, where the fermion nature makes the exchange energy have the opposite sign as 
the Hartree energy. The fermions of like spin wish to avoid each other. For *He, its boson 
character makes the opposite occur: the quantum nature of the particles tends to make them 
prefer to collect at the same spot, so that the exchange energy is positive. For a system with 
only two spinless bosons, the two-particle wave function must be symmetric under an 
interchange of coordinates, which can be achieved with the choice 


Pix (ti, r3) = lhu Eok (2) + by CDO (ty) (11.8) 


prr) = — = e™ (11.9) 


y 


a= 
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The expectation value of the potential energy for this wave function is 


V) = E nEn MEn) [Onder mn) PVE — r2) (11.10) 


1 i(k, —k,)°(r,—r 
= mE, mE) | Pry Proll + ett] (r — r) 
152 


=$ Dalby, MEA = 0) + Hk, — ko) (11.11) 


In the integral in the middle equation, the integration variables were changed to center of mass 
coordinates R = (r, + r,)/2 and r = (r; — rp), and JaR = v., The remaining integral over 
d?r produces V(q = 0) in the first term and V(k, — k,) in the second. The expression for (V) 
has two terms, which are the Hartree and exchange energies. They have a relative plus sign, 
whose origin is the relative plus sign between the two terms in the two-particle wave function 
(11.8). The two-particle fermion wave function, when the spins are parallel, has a relative 
minus sign, which causes the negative sign in front of the exchange energy. The occupation 
factors n(E,) are either ng for *He or np for *He particles. The subscript is omitted in the 
formulas, since they can apply to either case equally. The diagrams for Hartree and exchange 
are shown in Figs. 11.1(a) and 11.1(b). 

The next step is to improve the Hartree and exchange energies by replacing the potential 
V(q) by the equivalent T7-matrix result. The Hartree energy is the potential energy of a particle 
in k, from the interaction with the other particles in the state k, by the q = 0 interaction 
V(q = 0). This interaction can be viewed as a process whereby the two particles interact by 
mutually scattering. Go to center of mass coordinates: k = (k; — k,)/2, K = (k, + k,). The 
mutual scattering of two particles, which interact by central forces, does not alter the center of 
mass wave vector K. It is unaffected by the scattering process, so that the scattering can be 
viewed in a rest frame where K = 0. This scattering is shown in Fig. 11.1(c). The two 
particles approach with k and —k, scatter with a q = 0 interaction, and depart with their same 
wave vectors. In the center of mass transformation, the labeling convention is that the particle 


O* 
190 Teg k, 


ki k k, 


(a) (b) 


FIGURE 11.1 
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k is the one whose self-energy is being calculated, while —k is the other particle from which 
the first is scattering. The particle line of —k is part of a closed loop. 

The scattering of two particles is described by a T matrix Tw, which was introduced in 
Sec. 4.1. The choice of T matrix, rather than reaction matrix, is based on wanting retarded 
self-energies, since the T matrix has the proper dependence on ið in the energy denominators. 
Recall that the on-shell T matrix 7,,, applies only for elastic scattering, when the particle 
energy was € = Ép = p. The kinetic energy €, is conserved in relative coordinates, so this 
requirement is satisfied. The retarded self-energy for particles of energy €, in the Hartree 
approximation should be 


] 
Lak, 6k) = 5 2x, )T (k, —k,)/2,(k, —k;)/2 (11.12) 
2 


In the scattering from other particles, whose wave vector is k,, the scattering rate depends on 
the difference k = (k, — k,)/2 rather than on q = 0. The forward scattering with k’ = k is 
easy to obtain from Sec. 4.1, 


Tk = -ÍT Se + 1)e* sin[8,(k)] (11.13) 
2uk l 


where y is the reduced mass. By changing integration to k = (k; — k,)/2 and d?k, = 8d°k, 
the Hartree energy is 


16r dek 
Fk, by) = aloes 


(on): nS, gL! + Le sin[S,(k)] 


The same considerations should now be applied to the scattering term for the exchange 
energy. Examine the scattering in the center of mass coordinates, where the incoming 
particles have wave vectors k and —k. This diagram is shown in Fig. 11.1(d). Here the two 
particles exchange momentum 2k, so the final states are still k and —k. The solid line 
indicates the particle lines which are connected to make the self-energy diagram for 
exchange. This term has a T-matrix expression: 


1 
x(k), ék) = v2 Lnr )T (k, —k,)/2,(Ky—k,)/2 


_ tony ae TORN ECI + DC 1c sins] 11.14 


When making the expansion in angular momentum states /, then P,(—1) = (-1Y. The two 
self-energy terms, Hartree and exchange, are either added or subtracted, respectively, for 
bosons and fermions. These two self-energy expressions are added for *He: 


E(k, &) = ——5 [ KAK 21+ Ne sinB) | dQym(Ex ov) 
un even / 

For tHe, the two terms combine to eliminate angular momentum components / which are odd. 

Only the phase shifts (k) for l even need to be computed, and the others are irrelevant. 
For *He, there is exchange scattering between two particles only when the spins are 

parallel, and then the scattering rate has no dependence upon the angular momentum terms 

with / even but contains only terms with / being odd. Since there is no exchange scattering 

between particles with antiparallel spin, the energy of a particle contains an average over 
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events with spin parallel and antiparallel. For the self-energy for a particle in *He then 
X(k) = 22, — È.. 


11.1.2. Bogoliubov Theory of tHe 


The Bogoliubov (1947) theory describes Bose-Einstein condensation in a weakly 
interacting system. It is basically a form of pairing theory for the interacting bosons. The 
Hamiltonian (11.5) for a system of interacting bosons is expressed in the plane-wave 
representation. The main feature of the theory is the assumption that a Bose-Einstein 
condensation occurs at a nonzero temperature 7,. The condensation occurs because the 
chemical potential u goes to zero, and there is a macroscopic occupation of the zero- 
momentum state. The collection of particles in the zero-momentum state is called the 
condensate. 

The *He particles are treated as bosons. Because the number of particles is fixed, the 
number operator 


Np(Ok) = — (11.15) 
dk 
=|< í 5 Np(M,) (11.16) 


contains a chemical potential u. It is usually temperature dependent and varies to satisfy the 
implied definition in (11.16) that the integral over the occupation number gives the total 
number of particles N. The particle energy @, is yet to be determined, self-consistently, but it 
is positive and the zero of energy is set at @,_) = 0. With these conventions, the chemical 
potential for most boson systems is negative, so that the energy denominator in n;(@,) is 
never zero because @, > p. However, in Bose—Einstein condensation, the chemical potential 
ut vanishes, as illustrated in Fig. 11.2. The occupation number n,(@,) diverges as k — 0, and 
the number of particles in the k = 0 states is not well defined by this limit. Instead, assume 
that the number of particles in the state k = 0 is some number Nọ which is a finite fraction of 
the particle number N. Indeed, for a weakly interacting system, the condensate fraction 
Jo = Np /N approachs a number near unity as the temperature is lowered below T: he For 
T < T,, the particle number operator is expressed as 


np(@,) = Ny3(k) + (11.17) 


l 
eB — 1 
The delta function at k = 0 gives the number of particles in the condensate, and the second 
term applies only for k 4 0. One of the goals of the theory is to calculate this fraction fọ self- 
consistently. 

The creation and destruction operators ci and Cẹą must have special properties for 
k = 0. Normally, for bosons, when there are ną particles in the state k, the creation operator 


TA T 


FIGURE 11.2 Chemical potential as a function of temperature. Note that u becomes zero at T, in Bose-Einstein 
condensation. 
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gives Cing) = Jn, + I|n, + 1). In normal systems, the occupation numbers n, are integers 
on the order of unity. For the sat with k = 0, the average value of (ny) = No ~ 10%. Then 
the operations give 


CÈIN) = VNo + 1INo + 1) © V/NoINo) (11.18) 
ColMo) = VNolNo — 1) © VNolNo) (11.19) 


where the numbers Nọ + 1 ~ Ny when Ny ~ 107°. Both the creation and destruction opera- 
tors have a similar effect upon the system and give the result «No as an effective eigenvalue. 
The number Nọ is sufficiently large that the operators ci and Cp are treated as scalars whose 


value is ./No. 
ChCy = OC} = Mo (11.20) 


The next step is to examine the Hamiltonian and to isolate all terms in which any of the C 
operators has k = 0. These terms are replaced by ./N for both ci and Cy. Since u = 0, write 
the kinetic energy term as 


/ / 
56, CL Cy = Eq No + ye, CIC, = Se, CLC, (11.21) 
k k k 


The prime on the summation means to omit the term with k = 0. The kinetic energy has no 
dependence on the condensate, since the particles have a zero kinetic energy in this state. The 
interesting terms arise from the potential energy: 


i 
7 p> V (Chg Cy ngv Cr (11.22) 


When the condensate fraction f is near unity, the largest term is when k = k’ = q = 0, which 
gives an energy contribution in terms of the condensate density pọ = No/v = fop: 


l 
7; V(O)No = }4vpo V (0) (11.23) 


The next largest terms are those in which only three of the C operators has zero wave vector, 
but none exist. If any three particles have zero wave vectors, then so does the fourth, and that 
term has been counted already. The next largest terms are those in which two of the wave 
vectors are zero and two are not. There are six ways of pairing the four C operators in the 
potential energy term: 


k=k' =0 PE OCC 
k = q = —kK' SPoXqV(q)C,C_ 
g=0=k 15) V(O)Zy Ch Ce 
. (11.24) 
q=0=k >Po V(0)2,.C, Cy 


k=0,k’=-q 4p 9X,V(q)CiC, 
k’=0,k=q Poa VDC} Ca 
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It is customary to stop at this point and to keep only these terms in an effective Hamiltonian: 


/ 


Hy = Pf €x + PoV OCLC. + Spo WOICE + Cally + Cha} (11.25) 


The term pọ (0) is dropped, since this constant term just causes a shift in the chemical 
potential, which is still zero. The effective Hamiltonian given in (11.25) can be diagonalized 
exactly, which is the strongest argument for using it as the effective Hamiltonian. A better 
theory would retain additional terms. There are terms with three C operators with nonzero 
wave vectors, and of course terms with four. The Hamiltonian is written as Hy) + V, where Ho 
is given in (11.25). It contains the terms quadratic in the operators, while V is the other terms 
which contain three or four creation—destruction operators. The terms are separated in this 
manner because Hy is a Hamiltonian which can be solved exactly. The other term V must be 
treated as the interaction, which is evaluated by the usual Green’s function method to give 
additional self-energy corrections. 

The particles in the k = 0 state are the condensate, which is the ground state of the 
system. Quasiparticles with k Æ 0 are excitations, and Họ is the effective Hamiltonian of 
these excitations. It contains combinations of operators such as Cict, which correspond to 
the excitation of two particles from the condensate. Since momentum is conserved, they must 
have k and —k. Similarly, the term with C,,C_, is the destruction of two quasiparticles with k 
and —k when both are returned to the condensate. The terms with CC, correspond to the 
scattering of quasiparticles by the condensate, and there are both direct and exchange 
processes which lead to the terms with V(0) and V(k), respectively. 

The Hamiltonian (11.25) for Hp is solved to obtain the eigenvalues and eigenstates of the 
system. This solution is approximate, since it omits the effects of the potential V. The 
approximation of neglecting V and using only the eigenstates of Họ is the pairing theory of 
Bogoliubov. The Hamiltonian is rather easy to diagonalize. Each wave vector state is treated 
independently, and each of these is just the problem encountered in Problem 3 in Chapter 1. 
Introduce a set of harmonic oscillator coordinates Q, and P,, and the Hamiltonian is an 
example where the interactions merely shift the eigenfrequency of the oscillator: 


| 
Ok = Tee (Cu + Ct) (11.26) 
P, = j% (CË — C) (11.27) 
Ay = Ly {Py Py + Oy. Ox le + 2P9& V(K)]} 

i (11.28) 
Ho = 2 {PPk + OKO; Ox} 
OF = Erlek + 2p) /(k)] (11.29) 


The first two terms in Hy come from ci Cx, while the last term comes from the second term in 
the Hamiltonian Hj. The terms multiplying O,Q_, are combined into the square of an 
effective frequency Qg. The bosons behave as a harmonic oscillator system, for each wave 
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vector k, at the new frequency Qy. The Hamiltonian is diagonalized by defining a new set of 
creation and destruction operators 


/ 
Hy = 3 ONAA (11.30) 


Qk = l (a, + of p) (11.31) 


AJA, 


Q 
P, = ij- (at — ap) (11.32) 


In the pairing theory, the energy , is that of the quasiparticles, and a, are the operators 
which describe the destruction of these quasiparticles. 

The equation for the quasiparticle energy Q, = {€,[&, + 2Po V (k)|} “< makes no sense 
whatever, since the Fourier transform V (k) does not exist. It should be replaced by a T matrix 
or reaction matrix, and the latter is correct. According to Eq. (4.31), for each angular 
momentum state / one can replace the potential by the on-shell reaction matrix 


1/2 


_2n tan[6,(k)] 


V(k) > 4nR,(k, k) = ni 


(11.33) 


where u = m/2 is the reduced mass for the scattering of two identical particles of mass m. 
The reaction matrix has the important feature that it goes to a constant in the limit where 
k — 0. The phase shifts behave in this limit as 


lim õ;(k) = myn — ak?! (11.34) 
tan(0;) _ ] 
lim — = —ak’ (11.35) 


where m, is an integer which denotes the number of bound states in that scattering channel. 
The quantity a is called a scattering length. 

The largest term is from the s-wave channel, in which R,(0,0) = 2ra/u, while 
R,(O, 0) = 0 for / > 0. Only the s-wave needs to be included in the dispersion relation of the 
excitations as k > 0 


2 4 
V(k) > 4 = (11.36) 
u m 
Q; —> V 2E€kPol4ra/m) = ck (11.37) 
h 
——/4 . 
c = v y 4na po (11.38) 


The scattering length a is positive for repulsive potentials. For a hard sphere potential of 
radius ©, then © = a. 

In the earlier days of helium theory, the helium-helium interaction was approximated as a 
hard core at the radius a = o = 0.265nm of the Lennard-Jones potential. All the other 
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parameters are known since py is the He density. The velocity of the excitation may be 
calculated: 


a = 0.265 nm (11.39) 
m = 6.648 x 1077’ kg (11.40) 
Po = 2.2 x 10” atoms/cm? (11.41) 
c = 130m/s (11.42) 


The predicted velocity is close to the speed of ordinary sound in helium, which is 
c, = 220 m/s. This agreement was viewed as a great success of the pairing theory, since the 
collective excitations of the liquid at long wavelength must certainly be the longitudinal sound 
waves. 

Unfortunately, this agreement is entirely superfluous. The interaction potential between 
two He particles has attractive regions, and the phase shifts for this potential show there is a 
low-energy scattering resonance. The exact value of the scattering length a is uncertain 
because it is very sensitive to the potential. The expected values are more in the range of 500 
rather than o. A value of a ~ 500 will make the calculated speed of sound be 4 times larger 
than the experiment. Of course, the sign of a could be negative, in which case the predicted 
frequencies are complex, and the vibrations are unstable. A possible way out of this difficulty 
is to do the theory more carefully. This was done by Brown and Coopersmith (1969), who 
found even worse agreement with experiment. Their calculation represented all direct and 
exchange scattering by reaction matrices, and they self-consistently found the condensate 
fraction fo and the chemical potential. They obtained a speed of sound about 10 times that of 
the experiment and also predicted a roton energy about 10 times too big. Indeed, their 
theoretical excitation spectrum of *He is similar to the actual one but scaled up by a factor of 
10. This calculation was done carefully and correctly. Its lack of success is caused by the 
failure of the pairing approximation. Indeed, the history of the pairing theory is that each time 
the theoretical calculations were done better, the answers got worse. The conclusion is that the 
pairing theory is bad and should be discarded as a way of calculating the excitation spectrum 
of *He. Some of the concepts are retained, in particular that of the condensate. 


11.1.3. Off-Diagonal Long-Range Order 


The modern theory of superfluidity in tHe treats the liquid as a highly correlated and 
strongly interacting system. The pairing theory is completely abandoned, as is the repre- 
sentation of operators in plane-wave basis states. The one concept carried over is the 
condensate, or zero-momentum state, in which there is a nonzero fraction of the particles in 
the superfluid. There is an obvious problem of trying to decide the definition of “zero- 
momentum state” in a basis set which is not plane waves. It is really part of a larger problem 
of trying to decide what is meant by superfluidity and Bose—Einstein condensation in a 
system which is strongly interacting and highly correlated. The whole concept of Bose— 
Einstein condensation is based on particles occupying zero-momentum states, or not occu- 
pying them, and another basis set is needed for a more general definition. The method of 
doing this was introduced by Penrose (1951), and his ideas are described. Yang (1962) 
suggested the name of off-diagonal long-range order (ODLRO) for the type of ordering 
introduced by Penrose and also discussed by Penrose and Onsager (1956). 
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ODLRO was introduced as the type of ordering for superfluids, such as Bose—Einstein 
condensation in tHe, and electron pairs in superconductors. It is distinguished from diagonal 
long-range order (DLRO), which is the usual ordering one finds, for example, in crystalline 
solids. When *He atoms arrange themselves into a solid, as happens under pressure, they are 
exhibiting DLRO. But when they go superfluid, they exhibit ODLRO. The distinction 
between these two processes stems from the different behavior in the density matrix for each 
type of ordering. These differences will be explained. Further discussion is given by Kohn 
and Sherrington (1970). 

The system has N ~ 10% identical bosons, which are spinless. Their ground state is 
described by a many-particle wave function Yo(r;,r»,...,ry). The actual method of 
calculating this wave function, or at least some of its properties, is given in the following 
sections. The present discussion only needs to assume that the wave function exists. The 
subscript zero on Y, indicates it is for the ground state. Note that there is no assumption that 
each atom is in the ground state (1.e., k = 0) but that the system is in the ground state. When 
the particles are strongly interacting and highly correlated, they will spend little time with 
k = 0 and will fluctuate between many momentum states. The ground state is the lowest 
possible energy state for the whole liquid. It is not a static or rigid structure, since each atom 
fluctuates with zero-point energy. The present estimates for *He at T = 1 K give the average 
kinetic energy per particle as 15 K and the average potential energy as —22 K, so the average 
binding energy is 7 K. The large value of average kinetic energy shows the large amount of 
zero-point motion in the fluid, even at low temperature, which comes from the quantum nature 
of the fluid. The classical estimate (K.E.) = 3kgT/2 is obviously inaccurate. The ground state 
wave function V)(r,,1r,...,Iy) for *He describes a system with a large amount of zero- 
point motion. 

The square of the wave function gives the probability density for finding particles at 
positions r, in the system and is called the diagonal density matrix: 


Paiti r2, s Ty) = (Por). r2, n, ry) (11.43) 


The subscript N indicates that it applies to N particles. py is normalized so that the integral 
over all coordinates gives unity: 


l= jar -» Pry Py("},¥o,---, ry) (11.44) 


The one-particle density matrix is obtained from py by integrating over all but one coordi- 
nate: 


pi) = | Prar, Ary pyr, ry,.-.,Ty) (11.45) 


l = [dnp (11.46) 


The notation is a bit confusing, since p,(r,) is not the probability in a one-particle system but 
rather the probability of one particle being at r, in the N-particle system. Since all points are 
equivalent, p,(r,) is really independent of position—neglecting edges—so p; = 1/v, where v 
is the volume of the system. 
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Another useful quantity is the two-particle density matrix p,(r,, r2) which is obtained 
from py by integrating over all but two coordinates: 


P2(r1, r2) = [drar +- Pry py (ty, T3,- -, Ey) (11.47) 


1 
7 = | ron. r2) (11.48) 


Now p,(r,, r2) is the probability that any boson is at r, if there is one at r,. In a homogeneous 
system it must depend only on the difference r; — r,. In fact, it must be proportional to the 
pair distribution function g(r), and this relationship is 


l 
P2(r1, r2) = z8r: — r3) (11.49) 


which follows from the fact that g(r) is normalized to be unity at large r. 

Crystalline solids have DLRO, which is indicated by structure in g(r). Since the atoms 
are regularly spaced, g(r) has large values where the atoms are located and is zero otherwise. 
Thermal vibrations smear out the answer. 

Penrose (1951) used the idea of a general density matrix which is defined as the product 
of two wave functions with different coordinates: 


Py. r2,- Fy; r1, Y2,- Fy) (11.50) 
= P(r, rs, ..., ty) Vo, r, ..., ry) (11.51) 


This quantity is denoted by p, where the tilde is to distinguish it from the diagonal density 
matrix introduced earlier. Of course, they are identical if the two sets of coordinates in p are 
set equal: Py (r4, r2, ..., Fry; fj, r2,- ., ry) = P(T], r2, .. ., ry). The concept of ODLRO is 
contained in the function p(r,, r4) obtained from py when all but one set of coordinates are 
set equal and averaged over 


~ / 3 3 3 3. z . ol 
Pir ri) = |a rod r3d ra-e-d rypyt, r2, ..., Yy; rj, r2, ..., Fy) 


1 
pı, ri) = pır) = y (11.52) 


where p,(r;,¥,) becomes the diagonal density matrix p) when r; =r. In a liquid, the 
dependence on r, and r, can only be their difference, since there is no absolute frame of 
reference. It is convenient to define the quantity 


Rr; — r1) = vpi (r1, r4) (11.53) 

R(0)= 1 (11.54) 

which is normalized to unity at r = 0. R(r) is the function which is important in under- 
standing ODLRO. 

Some insight into P(r, r4) is obtained by reconsidering the techniques used in the 


weakly interacting systems. There a one-particle state function is defined in the plane-wave 
representation as 


(r) = 2 eKO, (11.55) 
k 
1 , 
C, = z| Erow (11.56) 
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The number of particles in state k is 
1 . 
nk = (CLC,) = - l Prd’ re OO NOT) (11.57) 


ną is found to be the Fourier transform of the quantity (®'(r)®(r’)), which must be a function 
of r —r’. Recall what this average means. Take a particle and find its wave function at two 
different points r and r’. This product is averaged, which must be taken over the other 
particles and their positions. This procedure is exactly the one which was used to obtain 
P(r, r1), except now it is recognized from the outset that the particle is part of a many- 
particle system. The procedure is to take a product wave function with one particle at two 
points r; and r, and average it over the other particles and their positions. The quantity 
P,(r,;,1) or its equivalent R(r,; — r2), is the many-body definition of the quantity 
(ot (r)®(r’)). It is also important to appreciate that R(r) is quite a different function from the 
pair distribution function g(r) and the two are not related. The quantity ną is also related to the 
Wigner distribution function where n,(R, T) = f dof (k, œ; R, T). 

The quantity n, is the number of particles, on the average, in the momentum state k. It is 
an important quantity in Bose—Einstein condensation, since one expects that one momentum 
state ky will have a macroscopic occupation such that ng, = foN, where fo is still the fraction 
of particles in the condensate. This fraction is of order unity, rather than O(1/N). Normally 
kọ = 0, which applies when the fluid is at rest. There are circumstances when another state 
has the macroscopic occupation, e.g., when the fluid is flowing at a uniform rate or when it is 
rotating. For k Æ Ko, 


d 
n, = [Enok £) (11.58) 


where A(k, £) is the spectral function for the boson particles. 

The procedure for finding ną is the same as in the free-particle case (11.57). There one 
took the Fourier transform of (®'(r)®(r’)), while now one takes the Fourier transform of the 
equivalent quantity R(r — r’). One difficulty with this procedure is that R(r) may not possess a 
Fourier transform. For example, how does it behave as r — oo? For fluids at rest, so that the 
macroscopic occupation is in the zero-momentum state, R(r) goes to a constant as r — ov. 
Call this constant R(co). The Fourier transform is 


Ny, = Po | Pree (11.59) 
= 9)R(0o)(27)5°(r) + Po | d>r[R(r) — R(oo)]e*' (11.60) 
= Nody—o + Po |nr) — R(oo)Je*'T (11.61) 


The density is pọ = N/v, No = foN. There is a delta function term at k = 0 whose amplitude 
is pọR(co). The quantity R(oo) = fo is the fraction of particles in the zero-momentum state. 
This fraction is the “order” which exists in the off-diagonal density matrix R(r). For k Æ 0, 
the momentum distribution of particles ną is found from the Fourier transform of [R(r) — fol. 

Figure 11.3 shows the function R(r) calculated for liquid *He by McMillan (1965) using 
Monte Carlo techniques. He used the Lennard-Jones potential between helium atoms and 
found fọ = 0.11. His fraction is similar to that obtained earlier by Penrose and Onsager, who 
got fọ = 0.08 for a gas of hard spheres. The latter estimate is also based on a Monte Carlo 
calculation. It appears, unfortunately, that reliable results can be obtained in this field only by 
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R(r) 


6 (A) 


FIGURE 11.3 Single-particle density marix as a function of separation. The solid line is the Monte Carlo calculation 
of McMillan (1965), which as asymptotically approaches a density fraction of fp = 0.11. The dashed curve is the 
Gaussian approximation. 


extensive computer calculations. There have been many calculations of these quantities; e.g., 
see Francis et al. (1970). Figure 11.3 shows that R(r) starts at unity and falls smoothly to its 
asymptotic value, which it reaches at about r = 0.4 nm. This distance is rather short, since it 
is only 1.5 atomic diameters. 

The quantity f = R(oo) is the fraction of time a particle spends in the condensate. 
Alternately, it is also the fraction of particles in this state at any one time. However, in 
calculating this quantity, it has not been assumed that the system is superfluid. The value of 
R(co) is found by solving the dynamical properties of normal fluids, which may seem 
paradoxical and is probably best understood in the next section which gives more details of 
the method. Normal fluids do not have a condensate and have f = R(oo) = 0. However, the 
point is that most methods of calculating fọ are based on the properties of normal fluids. For 
example, the estimate of Penrose and Onsager came from the equation of state of a classical 
hard sphere gas, which is certainly not a superfluid system. 


11.1.4. Correlated Basis Functions 


Correlated basis functions (CBFs) are the type of wave function most often employed in 
the study of the ground state properties of *He. They have the form 


Polri, r2, ..., ry) = Ly exp -Zur — J (11.62) 


i>j 


where Ly is a normalization constant which is specified by satisfying the integral (11.44). 
They were used by Bijl (1940), Dingle (1949), and Jastrow (1955) and are sometimes named 
after these authors; different writers preferring various combinations of their names. The 
CBFs have several advantages which make them desirable for a description of *He. The most 
important is that the wave function possesses the necessary symmetry of being symmetric 
under the exchange of any two coordinates. This symmetry is automatically satisfied by the 
summation of pairwise correlations. Another advantage is that Yọ has a simple form for 
which one needs to determine only one function u(r), which is usually found variationally by 
minimizing the ground state energy. 

A third advantage of CBFs is that the diagonal density matrix has a mathematical form 
which is identical to another problem which has been studied extensively—the classical 
fluid. For a classical fluid, the kinetic energy terms are irrelevant for determining particle 
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correlations, and only the potential energy V(r; — r;) between particles i and j is important. 
The density matrices for CBFs and the classical fluid are 


>J 


CBF: pN = |¥,|*? = L} oo] -25u — J (11.63) 
Classical: py(r1, r2, ..., ry) = Q, exp E> Vir; — J (11.64) 
i>j 


These have the same mathematical form when setting 2u(r) = BV (r). The classical problem 
has been extensively studied, and accurate calculational techniques have been developed 
which work well for liquids of neutral atoms such as the rare gases. The most successful 
methods are based on the Percus—Yevick equation (see Percus, 1964). The availability of this 
successful computational technology is one reason for the popularity and success of CBFs. 

One important quantity is the off-diagonal density matrix R(r), which is defined in terms 
of CBFs as 


Rr, — ri) = vLiy | drar, ws dry 


N 
x exp -7 [u(r, — r;) + u(r, — r;)] — 2 5 u(r; — | (11.65) 


j=2 i>j=2 


Let us determine the classical analog of this expression. That is, regard u(r; — r;) as the 
effective “potential” BV(r; — r;)/2 between particles and see where that leads us. Obviously 
there is a system of N — 1 particles which mutually interact with each other with the 
“potential” 2u(r). There is a particle at r} which interacts with these N — 1 particles with the 
different potential u(r). Another particle at r} also interacts with the N — 1 particles with the 
same relative potential u(r), so that it is identical to the particle at r,. Finally, the particles at 
rı and r} have no mutual interaction. This description is of a system of two impurity particles, 
at r; and r;, ina system of N — 1 other particles. The impurities are different, since they have 
a different interaction with the N — 1 particles and no interaction between themselves. This 
classical system can be solved by using the Percus-Yevick equations for binary mixtures, 
where one constituent is very dilute. This technique was used by Francis et al., (1970) for 
their investigation of superfluid *He. It is another example of where classical equations are 
solved to determine the properties of the superfluid. 

The ground state properties which are calculated with CBFs include the condensate 
fraction fọ, the momentum distribution ng, R(r), g(r), and S(k). Another property is the 
ground state energy per atom, which is usually presented as the separate contributions from 
potential energy (P.E.) and kinetic energy (K.E.). The potential energy per atom is given by 


dri- -dry V(r; — ¥,) Py (ty ++: Ty) (11.66) 


The density matrix p,(1)---ry) is unchanged when any pair of coordinates r; and r, are 
interchanged. With this fact, one can show that the average value of each potential energy 
term V(r; — r;) is the same as any other term V(r; — rj). The average potential of any particle 
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is the same as any other. The summation over i and j gives N(N — 1)/2, which is the number 
of pairs, and all pairs contribute equally: 


(P.E.) 
N (N — 1) [dndn — r) | Brad + Pry py (ty ++ ty) 


The integration over all variables except r} and r, produces the reduced density matrix 
.(r,, r2) = g(r; — r2)/v?. Then changing to relative coordinates r = (r; —r,),R= 
(r; + r2)/2, there is no dependence on R, so its integral yields v. The final answer is 


(P.E) Po f 3 

a) fa rV(r)g(r) (11.67) 
The average potential energy is obtained from the pair distribution function. This result is 
obvious for a system with pairwise interaction between particles. It does not depend on CBFs 
but applies to any liquid and is an exact identity. The CBFs are used to calculate g(r), which is 
then used in the evaluation of the average potential energy per particle. The factor of 5 occurs 
because each pair interaction is shared between two atoms. 

The average kinetic energy per particle is obtained from the fundamental definition 


KE) # 
ae) = : [Eri rylV Yolen ty)P (11.68) 
J 


The result for each term j in the summation is identical, so the sum gives N times the result 
for one term: 


KE) K 
a Pr Pry IM Felts r) (11.69) 


The evaluation of this quantity depends sensitively on the use of CBFs. The gradient of the 
wave function produces the gradient acting upon u(r): 


N 
Vi Foti. ..-5 ry) = LNV: oo » u(r; — D| = — Yo 2 Vinny = r;) 


i>j J= 

This expression is squared in the evaluation of the average kinetic energy. At first the 
evaluation appears to involve three-particle correlations. One can integrate all but the three 
coordinates 74, Fj, m» Which produces the reduced density matrix p3(r1, rj, rm). It would be 
difficult to evaluate p, accurately, since the knowledge of three-particle correlations is 
imperfect. However, McMillan (1965) suggested a method of avoiding this problem and 
expressing the result in terms of only two-particle correlations. His method is to write one 
derivative as acting upon the density matrix itself: 


N 
Py 2 Vint = r;) = —3V 1 Pw, f2, -> Tw) 
J= 


, (11.70) 

Á 3 3 

ERLE y [Andya =) Vipalr) 
J 
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All coordinates were integrated over except r, and r;. Eliminate the summation over j since 
each term contributes identically: 


2 
(K.E.) — -(N — pe 
4m 


= l Pridr N ul, r) Vipal, r2) (11.71) 


Again changing integration variables to relative coordinates, the final form of the result is 


2 
(K.E.) h l pp lt) det) (11.72) 


N — 4m dr dr 
The average kinetic energy is expressed as an integral over the derivative of u(r) and g(r). 
Neither quantity has an angular dependence, so the gradient acts only upon the radial variable. 
This form is used for the evaluation of the average kinetic energy per particle. The numerical 
results obtained by different investigators vary somewhat, depending on the details. Generally 
the results show that the average potential energy per particle has the magnitude of 20-22 K, 
while the average kinetic energy per particle is 14-15 K, and the net binding energy per 
particle is 6-7 K. These values depend on the particle density. 

So far the form of u(r) in CBFs has not been specified. That will be done now. Usually 
the contribution of u(r) is divided into a short-range component and a long-range component. 
The short-range component is the most straightforward and is discussed first. One expects that 
the atoms do not interpenetrate. There must be a term in u(r) which keeps them apart, which 
is done by having a term in u(r) which becomes very large as r — 0, which makes 
exp(—u) — 0. The form of this term can be deduced from Schrédinger’s equation and the 
repulsive part of the interparticle potential. Take a two-particle system, so that u(r) describes 
only the effects of correlation between the motion of two particles. In center-of-mass coor- 
dinates, the relative motion of the two helium atoms is described by the following CBF wave 
functions: 


2 
| h $ 2d (l+ 2 4 roje — Eel) (11.73) 


 2uļdr? rdr r? 
h| du du\” 2du (1+1) 
—ur)J_ | 7 a) ea — — 
Í | 2u dr? + (5) r dr r? TV) —E ° (11.74) 


If u(r) is diverging sharply as r —> 0, the dominant term among all the derivatives is (du/ dr)’. 
For example, if u x r~", then (du/dr} œ r~2"~2, while the second derivative diverges only as 
r-"-. The two most divergent terms must cancel, where one is from the kinetic energy and 
the other from the potential energy: 


1/2 
du — -|2 ro] (11.75) 


Students of physics should recognize this result and derivation. It is just the Wentzel- 
Kramers—Brillouin—Jeffreys (WKBJ) wave function for a particle penetrating into a repulsive 
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potential: as a specific example, in the Lennard-Jones potential the repulsive part is 4e(o/r)’”. 
In this case the short-range part of the CBF must be 


du SUE 1/2 19 6 
Sa -(25) (>) (11.76) 
Bue) oS yas 
seot) 
= 11.78 
É o( 25h? (11.78) 


For the potential between two *He atoms, ¢ = 1.484 x 107% J, o = 0.22648 nm, and the 
numerical result is a = 1.010. The constant a is nearly identical to the hard sphere diameter 
o. McMillan (1965) used the Lennard-Jones potential for his variational calculation and 
included no long-range potential. He tried correlations of the form u(r) = (a/r)", where a and 
n were both variational parameters. He used o = 0.2556 nm so a = 1.010 = 0.259 nm is our 
prediction. His minimum energy was found with n = 5 and a = 0.261 nm. These values are 
nearly identical to the results using the WKBJ between two particles. The short-range part of 
the CBF can be obtained from the wave function for a particle penetrating into a repulsive 
potential. This part of u(r) is determined once the potential is specified. 

The long-range part of u(r) is more subtle. There is a term which falls off as u(r) œ r7 
as r — oo. This contribution arises from the zero-point motion of the phonons. Bogoliubov 
and Zubarev (1955) showed, in the pairing theory, that the phonons lead to long-range 
correlations between the motion of particles in the liquid. Reatto and Chester (1967) showed 
that these fluctuations lead to long-range contributions to u(r). They recommended 
u(r) = u,(r) + u,(r), where the short-range result u,(r) = (a /ry’, and the long-range part is 


2 


mc 1 


— ——__—— 11.79 
2n? por? + r? ( ) 


u(r) = 


where the cut-off is r, ~ 0.20 nm. The parameter c is the speed of sound. The function u;(r) 
goes to a constant as r — 0 and only becomes proportional to r~* at large distances. An 
inverse square law is a rather slow fall off with distance and is the long-range behavior. 

The same u(r) was used by Francis et al. (1970) in the numerical study referred to 
previously. The inclusion of the long-range correlations will change R(r) at large r and n, at 
small k. In particular, the distribution ną diverges as k — 0 as a result of the long-range 
correlations. The divergence is shown in Fig. 11.4 where the results for ną of Francis et al. are 
shown for both the short-range correlations only (SR) and the short plus long-range corre- 
lations (LR). Their short-range result is nearly identical to that of McMillan. This agreement 
is expected, since they used the same interparticle potential but a different method of 
obtaining results, i.e., the Percus-Yevick equations for mixtures. The long-range correlations 
have a dramatic effect upon the theoretical values of n,. This low k divergence is found in the 
experiments, as discussed by Griffin (1985), and by Svensson and Sears (1986). 

Amid all these equations, it is worthwhile to gain a physical picture of the particle 
motions on the microscopic level. The average potential energy of 20 to 22 K per atom can be 
achieved by having four neighbors at the minimum well depth of 10 K, so the average 
potential energy per particle is 20 K. However, an inspection of g(r) shows that it peaks at 
larger values of r, and the neighboring shell of atoms is farther away than the minimum well 
depth at 0.3 nm. The potential energy comes from having about eight neighbors, each with an 
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FIGURE 11.4 Theoretical calculations of the momentum distribution of atoms in liquid *He. The points are due to 
McMillan (1965), and the solid line is from Francis et al., who used the Percus-Yevick equation both with and without 
long-range order. Source: Francis et al., (1970) (used with permission). 


average interaction energy of 5 K, which gives the per particle average of 20 K. In a classical 
fluid such as argon, g(r) peaks at the maximum well depth, so the first neighbor atoms are 
sitting at the distance of the maximum potential energy. Helium does not behave this way, 
because the quantum nature of the motion prevents that much localization of the particles, so 
the first neighbors are farther away on the average. 

The average kinetic energy is the more interesting number. The estimates of this quantity 
range from 10 to 15 K. What kinds of particle motion give this large number? There are two 
microscopic pictures which come to mind. The first has the particle motion as part of the zero- 
point motion of the long-wavelength phonons. Here the particles keep their nearest neighbor 
positions rather fixed, but the system executes long-range fluctuations. This Jell-O model has 
the system wiggling like an elastic medium. The amount of energy in this zero-point motion 
can be estimated as 


h 
(Z.P.E.) = Loa $ — q) (11.80) 
N Hy 
N 16n2p, (11.81) 


Here c is the speed of sound and pọ is the particle density. The cutoff wave vector q, is the 
maximum value of the excitation spectrum, which is considered to be a sound wave, i.e., over 
which œ, = cq is an accurate approximation. Francis et al., estimate it to be q, = 0.5 
AT! With this value and the other known parameters for *He, one can estimate that (Z.PE.) = 
0.33 K. The result 0.3 K is a negligible part of the total kinetic energy per particle. This 
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low estimate is confirmed by the calculations of Francis et al., who found that the long-range 
correlations (and fluctuations) changed the average energy per particle only by 0.5 K. 

The average kinetic energy per particle does not come from the long-range fluctuations. 
Instead, it seems to come from the short-range motion. The first peak in g(r) is at 4 A, so the 
atom sits in its own space in the liquid, which is roughly a sphere with a radius of about 2 A. 
A particle in a spherical box of radius a has a kinetic energy of A’n?/2ma?. Using a = 2 Å 
and the mass of helium yields a kinetic energy of 15 K, which is just the right magnitude. The 
kinetic energy comes from the short-range fluctuations of the particle bouncing around inside 
its own small space in the liquid. In the ground state of the liquid, at very low temperatures, 
the individual particles are moving rapidly with this motion. 

It is also useful to have a simple approximation for ną and its Fourier transform R(r). 
Puff and Tenn (1970) observed that neutron scattering experiments, which will be described 
later, showed that the particle distribution was Gaussian. They suggested the form 


Qnh? 
nk = Nooy=0 + (l -mmf mw 


1/2 
) exp(—é€,/w) (11.82) 


One advantage of the Gaussian distribution is that one can calculate quantities easily. For 
example, the average kinetic energy per particle is simply 


(K.E.) 2nh? dk sew 
a = (zE) | oor $ = 3(1—f)w 


From their fits to the neutron scattering data, they deduced that this average kinetic energy is 
about 15 K. It is their result we have mentioned several times previously in this chapter. This 
form has a certain appeal. The classical distribution is also Gaussian, except then w = kpT, 
and the average kinetic energy is (3/2)kgT. In *He there is still a Gaussian distribution, but 
the width w is much larger than predicted by the temperature; one estimates that w= 11 K in 
the limit where T — 0. 

The off-diagonal density matrix R(r) is the Fourier transform of ng. When 1, is Gaus- 
sian, so is R(r): 


Ik 
R(r) = ra (ny nye*'* = fy + (1 —fo) exp(—17/b*) (11.83) 
2 
b = 2h (11.84) 
mw 


This Gaussian is the dashed curve in Fig. 11.3, compared with the calculations of McMillan. 
The fit is reasonable, considering that the parameters were taken from the estimate of Puff and 
Tenn that w = 11 K, which gives b = 1.48A 

It is likely that neither ną (for k Æ 0) nor R(r) are exact Gaussians in the superfluid state 
and that the fit to this functional form is only an approximation. Nevertheless, it is useful to 
have approximate analytical forms, and the Gaussian fits best. Puff and Tenn, in fitting 
the neutron data to a Gaussian plus a delta function at k = 0, estimated that f = 0.06 at 
T =1.27 K. Their estimate is in agreement with later measurements and extrapolates to a 
value of fọ = 10.8% in the limit where T — 0 if one uses the Bose-Einstein extrapolation 
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11.1.5. Experiments on n, 


Feynman first suggested that neutron scattering would be an excellent means of inves- 
tigating the structure of liquid *He. His remarks were aimed at a measurement of the exci- 
tation spectrum. These measurements by neutron scattering have been very successful and are 
described in the next section. However, in the past three decades another series of neutron 
scattering experiments have been performed on liquid *He in order to measure the momentum 
distribution of particles n,. This latter experiment involves using higher-energy neutrons and 
analyzing the data in a fashion different from that used to find the excitation energy. The 
primary objective of this effort is a direct measurement of the condensate fraction fọ. 

Hohenberg and Platzman (1966) made the suggestion that inelastic scattering by very 
energetic neutrons (say 0.1 eV) would provide a measurement of ną and hence the condensate 
fraction fp, which would show up as a delta function on the n, distribution. Of course, the 
delta function would be broadened by experimental resolution, but should still be apparent if 
the estimates of f) = 0.1 were correct. The reason for using very energetic neutrons is that the 
sudden approximation becomes valid when the neutron energy is much higher than the 
kinetic or potential energy of the *He atom. In the sudden approximation, one has a direct 
measurement of n,. 

In the neutron scattering experiment, the neutrons with initial wave vector k; and initial 
energy E; = h? k? /2m are directed toward the scattering chamber. Some neutrons are scattered 
and leave the sample with a final wave vector ky = k; + Q, where Q is transferred to the 
liquid. Similarly, the final energy is Ey = (k; — Q)?/2m = E; — œ, where œ is the energy 
transferred to the liquid. The scattering cross section for *He can be expressed in terms of the 
dynamic liquid structure factor S(Q, @) as 


d?o _ ky 


Jado k SoS (Q, œ) (11.85) 


where 6, is a cross section for neutron scattering from a single alpha particle. Equation 
(11.85) is a general result which is always valid for neutron scattering. Generally a calculation 
of S(Q, œ) is quite complicated. However, in the sudden approximation, it can be approxi- 
mated by the expression 


50,0) = [4 n]o(o-2 2%) a(o 24 2*)] arso 


Po (2n)° k 2m m 2m 


where n, is the momentum distribution of particles in the ground state of the liquid. The 
outline is to first discuss the implications of this result; the derivation is provided later. The 
momentum distribution nę is written as the condensate term Nod, plus the other term which 
is called n,. These two are used in (11.86), and the wave vector integrals are done as far as 


possible (£o = Q*/2m) 


k = NoÒk—0 + Ny (11.87) 
S(Q, ©) = SQ, ) — SQ, —o) (11.88) 
- ek . 
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The angular integral dv = d(cos0) over the delta function equals m/kO if k> 
(m/Q)|® — E9| = kp(@), so the final result is (w > 0) 


- m OO 
S(Q, ©) = 2nfod(@ — £ + | kdkn 11.90 
ASE = 0) + 5505. hii (11.90) 


Equation (11.90) predicts that S(Q, œ) for @>0 has a delta function contribution at the 
energy loss œ = £ọ = Q’/2m. The delta function corresponds to the neutron knocking 
particles out of the condensate, and the fraction of the spectral strength in this process is fy. Of 
course, this delta function is broadened by the resolution of the measuring apparatus but 
should still be observable. The second term in (11.90) gives the scattering from the particles 
not in the condensate. This term is left as an integral. However, 7, may be obtained from data 
with small numerical scatter by taking the following derivative (œ # £ọ): 


m? 


0 
wQ. A) ~ mno 


[(@ — EQ Nk (o) + (@ + EQ )Nx(-o)| 


This equation provides a direct method of measuring 7,. As pointed out by Puff and Tenn, the 
Gaussian distribution in (11.82) fits the scattering data well. For this distribution, the integral 
can be evaluated analytically 


S(Q, «) = 2nfy[8(@ — £9) — 8(w + £0)] (11.91) 
+ (1 — fy) ; [e~(@-£9)" /Aeqw _ e (Oreo) /4eqw) (1 1 .92) 
fQw 


Equation (11.91) predicts that the spectral function S(Q, œ) is the difference of two Gaus- 
sians, peaked at œ = +eọ and with a width given by 2, /e€gw. There is also the delta function 
at œ = +&, from the scattering from the condensate. The results appear as a single Gaussian 
whenever £o >, which is the usual experimental case. 

There have been numerous measurements of ną, including those of Harling (1971), 
Rodriquez et al., (1974), Svensson et al., (1981), and Svensson and Sears (1986). The data of 
Woods and Sears (1977) is shown in Fig. 10.5. The solid line in Fig. 11.5(a) is a fit to their 
points taken at T = 4.2K when the liquid *He is not superfluid. This line is a remarkably 
good fit to a Gaussian. The other set of points are taken at T = 1.1 K in the superfluid state. 
There is an obvious increase in the distribution m, at small values of k, as shown by the 
difference spectra in Fig. 11.5(b). The increase at k ~ 0 has the width of their resolution 
function and is interpreted as being caused by the condensate fraction f. They estimate 
Jo = 6.9% + 0.8% from this data. If this value of f} is extrapolated to zero temperature by 
using the Bose-Einstein formula, one finds that A(T = 0) = 10.8% £1.3%. The value 
Jo = 9.11 is in good agreement with the theoretical estimates, with the earlier results of 
Harling, and with the analysis of Puff and Tenn. However, there is little justification for using 
the Bose-Einstein formula for extrapolating f(T) to zero temperature except expediency, 
since no other formula is available. It also appears from the data of Woods and Sears that the 
values of ną in the superfluid state are not as good a fit to a Gaussian as they were in the 
normal fluid. This feature is not understood. 

Equation (11.86) for the sudden approximation is now derived. The first step begins 
with the definition of the spectral function S(Q, œ). As in the case of the electron gas in 
Chapter 5, the spectral function is derived from the density—density correlation function. If 
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FIGURE 11.5 (a) Neutron scattering results for the momentum distribution in liquid “He at T = 1.1 and 4.2 K asa 
function of p? (b) The difference between the two distributions in part (a) shows a peak at small wave vector whose 
width equals the experimental resolution. The peak is interpreted as the condensate fraction. Source: Woods and Sears 
(1977) (used with permission). 


700 Chap. 11 e Superfluids 


this correlation function is called %(Q, i,,) in the Matsubara representation, then S(Q, œ) is 
—2 times the imaginary part of its retarded function: 


Lf, | 
1(Q, io,) = — ral dre" (T.p(Q, 1)p(—Q, 0)) (11.93) 
0 
S(Q, o) = —2 Im[¥,(Q, w)| (1 1.94) 


According to the result of Problem 16 in Chapter 3, S(Q, œ) can written as 


SQ o) =F e| dre (PQ, DQ, 0) (11.95) 


The experiments are done with conditions of temperature and energy transfer hœ such that 
Bw >> 1, and the second term in the parentheses exp(—Bw) can be neglected. The space 
representation is used for the density operator p(Q), which brings us to the expression 


p(Q) = 9 e2 (11.96) 
J 
S(Q, ©) = 1 >| dte’™ (eQ BRO e-iQ RO) (11.97) 
N FJ- 


The correlation function is evaluated by determining how the particle motion R,(t) proceeds 
in time. The two exponents Q:R; and Q:R,(t) cannot be combined, since they do not 
commute at different times. This point will be shown later. 

The sudden approximation is obtained by solving the correlation function for small 
values of time ¢ by expanding R,(t) about point t = 0. The physical argument is that the 
scattering by energetic neutrons happens rapidly, so that only the short time response of the 
systems is applicable. Near t = 0, the coefficients in the Taylor series are found from the 
commutation relations: 


RO-RO +E 4.2 (E O(P) (11.98) 
jD = YDF) taa) t 
AR RI 11.99) 
a ER “ 
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The next step is to separate the several terms in the exponent into the product of several 
exponents. The Feynman theorem is used: exp(A + B) = exp(4)exp(B)exp(C) with 
B = iQ: R;, A = i(Q/m) - (tP, + t F;/2), and C = —[A, B]/2 = —iteọ. The next step is to 
separate the factor iQ - P,/m from it*Q + F,/(2m). They do not commute, but such corrections 
terms contribute on the order O(¢*), which are being neglected. The short time response is 
written as 


S(Q, ©) = 1 5 | * dtel(-e0) (eQ: P;/m gil? /2m)Q-F; iQ- R; eQ Ri 
N F J—co 

The sudden approximation is achieved by making two important approximations to this 

expression. The first is to take only the term in the series with i = j. This choice is a valid 

approximation when Q is large, since then the terms with i Æj average to a small contri- 

bution. The second approximation is to neglect the term it7Q- F,/2m. The quantity F, is the 
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force on the particle at R;, which must average to zero. If this force is evaluated by a cumulant 
expansion, for example, the average (F, -Q) is zero, although the average ((F;- Q)*) is not. 
This force contributes a term in the time expansion on the order of O(t*) rather than O(?*). It 
is neglected. 

In the sudden approximation of high Q and high œ, the remaining expression to evaluate 
is: 


1 0° 
S(Q, 0) =F l dte" ©—0) (eQ P;/m) 
N j J-oo 
(11.101) 
(e"Q: P;/m — Jar, woe A Ry V5(Ry a. Ry ete Pi (R; o. Ry) 


The quantity exp(itQ : P,/m) operates on the wave function to the right and displaces the 
position variable R, by the increment 1Q/m: 


e”Q`P;/mp (Ri), R,, wey Ry) = Y(R], R,, e... R, + tQ/m, wey Ry) 


The next step is to do all the position integrals except dR. The multiple integrals just 
produce the off-diagonal density matrix p(R,, R; + tQ/m). Since the off-diagonal density 
matrix is just a function of the difference of its arguments, the expression gives 
p(R,, R; + 1Q/m) = R¢Q/m)/v: 


S(Q, œ) = >» | dte” ©- R(tQ/m) (11.102) 
j Jœ 
S(Q, @) = [ dte" ©- R(tQ/m) (11.103) 


The summation over j gives N, since all terms in the summation are alike. Earlier it was 
observed that R(r) was approximately a Gaussian function of r. In this case, the integrand of 
the time integral is also a Gaussian. A Gaussian is found since the exponent is evaluated to 
order O(?*). 

To derive to the standard formula for the sudden approximation, remember that ny is the 
Fourier transform of R(r): 


3 
R(r) = loos n,e ™" (11.104) 
3 0° . 
S(Q, œ) = sha m| dt ep| (o — Eg — 2-1) | (11.105) 
_ 2m dk Q:k 


which is the first term in (11.86). The second term in (11.86) comes from realizing that 
S(Q, œ) is antisymmetric in o. 

Two important assumptions are made in the derivation of the sudden approximation. The 
first is that Q is large, and the second is that œ is large so that the Fourier transform involves 
only small values of t. The experiments are done such that both conditions are well met. Then 
the approximation is accurate and the experiments determine ng. 

The sudden approximation has the appearance of treating the particles in the liquid as 
being free. The same result could be derived by assuming there was a gas of free particles 
with a distribution ną. When the neutron strikes a particle of momentum k and kinetic energy 
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€, = k*/2m, the particle is changed to wave vector k+Q and kinetic energy €x iq = 
(k + Q)*/2m. Energy conservation demands that the energy transfer be œ + & = &49. The 
probability of having a momentum—energy transfer of (Q, œ) is proportional to 


3 . 
—| ae 5(w- 29-2) (11.107) 


Po J (2x)? * m 


This equation is exactly the form of the sudden approximation. What happened to the 
potential energy? It is not really ignored. In the sudden approximation the essential physics is 
that when the neutron strikes the *He particle, its potential energy does not change imme- 
diately. The impulse will alter the particle velocity, and after a time duration the subsequent 
motion will alter its potential energy. However, by that time the neutron is gone and does not 
record this change in potential energy. The potential energy depends only on the position of 
the particles, and when the neutron strikes one particle and redirects its motion, the potential 
energy only changes later. The fast-moving neutron does not remain around to record this 
alteration in potential energy, so that the change in energy appears to involve only the kinetic 
energy component. 


11.1.6. Bijl-Feynman theory 


The excitation spectrum of *He is shown in Fig. 11.6 as determined by neutron scat- 
tering by Cowley and Woods (1971). These points are found as maxima in S(Q, @). For some 
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FIGURE 11.6 Neutron scattering results for the excitation spectrum of liquid *He at T = 1.1 K. The lower solid line 
is the one-phonon curve, while the dashed line is for the noninteracting particle. A broad second peak becomes the 
single-particle curve at large Q and the two-roton bound state at small Q. Source: Cowley and Woods (1971) (used 
with permission). 
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values of Q there are two values of œ, which indicates that S(Q, œ) has two peaks. The dashed 
line shows the free-particle spectrum £, = k? /2m. The excitation spectrum approaches this at 
large values of k, as shown by Harling (1971). At small values of k, the lower branch of the 
dispersion curve becomes linear in k. Small wave vector is the phonon region of sound waves, 
and the slope is the sound velocity, 240 m/s. The lower dispersion curve has a second 
minimum which 1s called the roton. This point has a high density of states and so has a large 
weight when averaging over the density of excitation states. Indeed, the necessity of this 
minimum was deduced by Landau (1941, 1947) on the basis of thermodynamic data. The 
roton parameters are ky = 1.91 A~' and A=8.68 K. There is a two-roton bound state at 2A 
which forms the lower limit of the upper dispersion curve (Ruvalds and Zawadowski, 1970) 
Zawadowski et al. (1972). The temperature dependence of the excitations was measured by 
Stirling and Glyde (1990). 

This experimental curve forms the basis for the low-temperature and superfluid prop- 
erties of *He. It explains all macroscopic behavior, based on the Landau two-fluid model. 
Once the excitation spectrum is derived, superfluidity can be explained. Deriving the exci- 
tation spectrum is the objective of this section. 

The successful method of finding the excitation spectrum of liquid *He was popularized 
by Feynman (1954). He suggested that the excitation spectrum is given by the formula 
€,/S(k), where S(x) is the liquid structure factor. This formula is qualitatively right but not 
quantitatively accurate. However, its importance is that it led to the formalism which does 
produce the right result. The same formula ¢,/S(k) was obtained much earlier by Bijl (1940), 
whose suggestion attracted little notice since neither S(k) nor œ, were known in those early 
days. The current usage is to call this expression the Bijl-Feynman formula: 


a(k) = 5 (11.108) 


This result is called @o(k) and is derived next. 

The goal is to construct a wave function ‘P,(r,,1r,...,¥y) which describes an exci- 
tation in the system with a wave vector k. For the boson system of liquid “He, this excitation 
wave function must still have the property of being symmetric under the interchange of any 
two-particle coordinates. For long-wavelength excitations, the wavefunction will still have the 
same kind of short-range correlations which were found for the ground state wave function. It 
seems reasonable to assume that the excited state wave function is only a slight perturbation 
upon the ground state wave function. A suitable wave function is obtained by writing the 
excited state as the product of the ground state wave function and another symmetric function 
Ax(r1, ..., ry) of all the positions: 


P(r, see rv) = A, (1), reer ry) ¥o(r1, rary rv) (11.109) 


The idea is simple enough. The stipulation on A,(r,,...,1y) is that it is symmetric in the 
coordinates and contains the wave vector k. Two possible choices are 


AM (r,, ..., ry) =k Deen = L,p(k) (11.110) 


AP este) = expt En) (11.111) 
i 
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Perhaps the reader can think of others. Each choice of A, corresponds to some kind of excited 
state of the system. What is the physics behind each one? It is helpful to use operators such as 
the total momentum of the system: 


P="yy, (11.112) 
J 


For example, the total momentum on Y® = AO, gives two terms, since P can act upon 
either AC , or Yo. However, the average momentum in the ground state must be zero, so this 
can be evaluated to give 


(PO Ppp) = Jer Pry PAP PAY + AC PY] 
=> hk | ar Pry PAC Aw, = Nik 
J 
since A?* A?) = 1. The total momentum of a system of N particles is just Nk, which 
corresponds to the uniform flow of the entire fluid with velocity k/m. That is the correct 


interpretation of the excitation spectrum represented by the choice P? = ACP. Another 
check on this interpretation is to find the energy by evaluating the Hamiltonian: 


E = k (PAPO) (11.113) 
— -E ) Jer Pry PAC” [WoW AP + 2V,A + Vo + AVY | 

tay] PrP rio? ELC) (11.114) 

B=4.+72 (11.115) 


The average energy per particle is ¢, = k? /2m plus the ground state value E,/N. The cross 
term 2V, AQ ” VF, in the preceding expression averages to zero, since it contains the average 
momentum | PVY, in the ground state of the stationary liquid. 

The other excited state wave function po = APP, is the important one for the 
excitation spectrum of the stationary liquid. The superscript (1) will be dropped, and it will be 
called just Yg = A, Yo = Lkp(k)Yo. This term will be investigated systematically. The first 
step is to find the normalization constant L, for the wave function, which is done by setting 
the normalization integral to unity: 


1 = (YI) = 5 Jer -- -Pryl Yol? explik « (r; — r;)] (11.116) 
jl 

The terms in the double summation with / = j just give unity since the wave vector depen- 

dence drops out. Special care must be taken whenever / Æ j. Then the space integrals can each 


be evaluated, at least formally, except those with r, and r;. These integrals over d?r; --- dry 
just produce the pair distribution function p,(r,, r;) = v-?g(r,; — r): 


l . 
| = L z 1+5 2 | end nat — peremo | (11.117) 
ji jA 
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The summation for each particle / is the same, since all particles are identical. This 
summation is eliminated and instead multiply by N: 


1 | 
l= a + Ya | ond nae, — meen) (11.118) 
j 


Each term in the summation over j gives the same result, so just do one term and multiply by 
N — 1 ~ N. Also, change integration variables to relative coordinates d’r,d°r, = vd?r, so the 
integral is over g(r). In fact, this expression is precisely the definition of S(k) which is given 
earlier in Sec. 1.6: 


l= wia(1 +p [rne] = NS(k)L? (11.119) 
1 


a = Teas 


Equation (11.120) provides the rigorous evaluation of the normalization constant Lg. It is 
useful to call the structure factor S(k) although it is only a function of |k| = k in the liquid. 

Now that the wave function is normalized properly, it is possible to investigate its 
properties by taking the expectation of different operators. The first is the momentum operator 
P given in (11.112): 


(11.120) 


(P) = (PIPI Y) (11.121) 


First examine a single gradiant of position V, acting upon the wave function P}. Since the 
function A, is a summation over different coordinates, the one term with 7 = j will be 
different from the others: 


VPT -srj ry) = Ly 2 Ve Po) (11.122) 

= Lk VE TP) +5 enuy Po 

Aj 
= ikL,e™ "Py + AV; Po (11.123) 
The momentum has a simple result 

PY, = kY, + A, PY, (11.124) 
(P) = Ak + (VY, |A, PY) (11.125) 
= hk + (Y, |P|F,) = rk (11.126) 


The first term gives fk. The second term averages to zero because the operators ATA, 
average to unity, and then (Y, |P|F,) is the momentum in the ground state, which is zero. The 
momentum in state Y, is ik. The momentum operator commutes with the original Hamil- 
tonian, so that states can be described by the quantum number of momentum /ik. 

The next step is to calculate the energy of this excitation whose wave vector is k. The 
energy is found by taking the expectation value of the Hamiltonian operator between these 
states: 


Ey = (PIA P,) = Ly (Yop(—k)H p(k) ¥o) (11.127) 
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It is desirable to commute the Hamiltonian operator to the nght, so that it operates on the 

ground state Yo. The ground state is an exact eigenstate of the Hamiltonian HW) = Ey) and 
is the one with the lowest eigenvalue Ep. 

H p(k) = Hp(k) — p(k)H + p(kK)H = [H, p(k)] + p(k)H (11.128) 

Ey = Ly(Pop(-KLH, p(k)%o) + Li(¥op(—k)p(K)H¥o) (11.129) 

The second term on the right gives Eyl? (Fo p(—k)p(k)¥o) = Ep. The first term on the right 


contains the commutator of H with the density operator p(k). Only the kinetic energy term 
fails to commute with p(k) so 


h’ | 
H, p(k)] = -— SIV), e™ 5 11.130 
[H, p(k)] Im% en] ( ) 

Ar 
= 3m ek tik? — 2ik- Vi] (11.131) 
(‘Po p(—-K)LH, p(k)]'¥o) = £: (Pop(—k)p(k) Fo) (11.132) 
2 

— 1E Dp ke nk -V Yo) (11.133) 


J 


The second term on the right must be treated carefully. Since the ground state wave function is 
real, we can replace Po V;'¥q by VE /2 and then integrate by parts, so that this term is 
transformed to 


(Pop(-K)[H, p(kK)|¥o) 
2 


h 
= &(Pop(—k)p(k)'Po) + 15 2 (Puk V,lp(—k)e™""]) (11.134) 
J 


ih? ik- (r-r 
= Sfora Eivy |e tr ') (11.135) 


The gradient operator in the last term will produce a factor ik in each term except the one 
where / = j. The two terms will cancel term by term except the one where / = j, which exists 
only in the first term. The result is 


(Yo p(-K)IH, p(k) Po) = &; 21 = Ne, (11.136) 
J 


Since L? = 1/NS(k), the first term in (11.129) for E, is ¢,/S(k), and the second is Ey. The 
energy in the system for the excitation k 1s 


E, = (Y;,|H|¥,) = Ey + olk) (11.137) 
a(k) = ab (11.138) 


The excitation energy is defined as (Kk), since it is the energy above the ground state Eg. The 
result is the Bijl-Feynman excitation energy. 

The solid line in Fig. 11.7 shows @ (kK) and is compared with the actual excitation 
spectrum of liquid He. The two curves have exactly the same shape, but the Bijl-Feynman 
spectrum is about a factor of two larger. The roton minimum in the Bijl-Feynman dispersion 
curve is caused by the peak in S(k) at k ~ 2A. At small values of k, @)(k) goes to zero 
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FIGURE 11.7 The excitation spectrum of liquid *He. The solid line is the BijlI-Feynman theory, compared with the 
experimental data as the points. 


linearly with k, and the slope is the speed of sound. In the Bijl-Feynman theory, this happens 
because £, vanishes as Å? while S(k) vanishes as k, so their ratio does vanish as k. One cannot 
use experimental values of S(k) in this region, since the experimental values of S(k) have an 
additional contribution from thermal fluctuations which provides a small constant to S(k) in 
the limit where k — 0. This constant, which is proportional to kgT times the compressibility, 
is easily subtracted from experimental S(k) to obtain the S(k) one desires to use in the Bijl- 
Feynman formula. 

The agreement between theory ®)(k) and experiment (k) is not satisfactory. The theory 
must press onward, and one must look for additional interactions or mechanisms which will 
improve the comparison. The interactions between these excitations must be included. Next 
consider a system with multiple excitations, where each is obtained by multiplying the ground 
state by the factor A, = L,p(k): 


Y, = Ak Yo = Ik) 
Fk, k, = Ak, Ax, Yo = ik}, k,) (11.139) 
Yy, pas k, — Ax, “Ay Fo = |k;,...,k,) 
The A, are scalar functions, so it does not matter in which order they are arranged. These 


states are examined carefully in the next section to see whether the different excitations are 
independent or whether they interact. 


11.1.7. Improved Excitation Spectra 


The method of Feenberg (1969) will be described, which is based upon the states in 
(11.139) which contain multiple excitations. They are examined to test their orthogonality and 
their interaction. The state of one excitation |k) has an interaction with the state of two 
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excitations |k’, k — k’). They contribute self-energy effects for the state of one interaction. 
These self-energy effects are just what is necessary to give a good spectrum. The final result 
will be w(k) = ®o(k) + 2(k), where X(k) is the self-energy of the excitation k. The method 
of calculating this self-energy is only outlined; a full description is in Feenberg’s book. 

The reader should be aware that the present procedure is dramatically different from any 
used elsewhere in this book. All the other problems have been based on the interaction 
representation, where the Hamiltonian is written as H = H) + V, where V is a well defined 
perturbation. The present approach does not do this at all. Instead, a set of states such as 
(11.139) is written down as an ansatz. These states cannot be readily identified with any part 
of the Hamiltonian and are not orthogonal eigenstates of any operator which is obvious. Nor 
is it straightforward to do perturbation theory, since there is no V in which to expand. These 
problems can all be overcome, but the reader should be alerted that the present derivation is 
quite different from previous ones. 

Feenberg and his associates established the following properties for these excitation 
States: 


(kp|kp) = 1, kÆp (11.140) 
(kk|kk) = 2 (11.141) 
(kp|H|kp) = Ep + œ(k) + @(p) (11.142) 


These properties are exactly those for operators A, which are were treated as boson creation 
operators at. For example, the state with two excitations in k is 


Ik) = a}.|0) = |1,) (11.143) 
Ikk) = atat |0) = al |l) = V2|2,) (11.144) 


where the ./2 comes from the usual raising operators for bosons. This factor explains the 
additional factor of 2 in (11.141). The state |k,k,) has an excitation energy given by 
Wo(k,) + @o(K,) and seems to be composed of states with independent boson excitations. 

However, this free-boson analogy should not be pushed too literally. For example, states 
with different numbers of bosons but with the same total momentum are orthogonal only to 
order O(1/./N). For example, 


1/2 
(k,, k,|k, +k) = eee (11.145) 
1 
(k,, k,|H|k, + k,) = (=) (11.146) 


The last two matrix elements are important for calculating the self-energy of the excitations. 
There exists a matrix element for one excitation of wave vector k going to two states with k’ 
and k — k’. The self-energy diagram is shown in Fig. 11.8. The excitations are represented by 
wavy lines, and the one excitation has a self-energy from making two excitations. The same 
Feynman diagram is calculated for the self-energy of a phonon from anharmonic interactions. 

The matrix elements in (11.146) show that the excitations are not truly independent. 
Anharmonic processes contribute to the self-energy. It is derived in the following fashion. 
Since there is no obvious interaction V in the present problem, instead use H — E. The basic 
matric element is (p, k — p|(H — E)|k) which uses both results in (11.146). From second 
order Brillouin—Wigner perturbation theory, the self-energy is 


E(k, £) = 1y PE = pI = DIK) 


p E — Ey — o(p) — p(k — p) (11.147) 
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A self-consistent equation is derived for œ(k) = @ (kK) + X(k). Note that E appears in 
the matrix element and the energy denominator. Feenberg and his collaborators took the 
quantity (H — E) ~ (H — Ey — @o(k)) in the matrix element, and found only an approximate 
eigenvalue from their results. The details of calculating the matrix element of H between the 
excitation states is tedious, since it involves an evaluation of three-particle correlation 
functions for the liquid. Their numerical results are shown in Fig. 11.9. as the dashed line. 
They compare quite favorably with the experimental spectrum as found by neutron scattering. 
The result is a dramatic improvement over the Bijl-Feynman curve, which is marked as B—F. 
Probably the first calculation of this kind was done by Feynman and Cohen (1956, 1957), who 
did a variational calculation of the backflow around the excitation. That is exactly the same 
physics as the admixing of the double excitation states, so the two methods of calculation are 
really the same but dressed in a slightly different language. 

The theory of liquid *He is remarkably successful. Starting from the potential function 
V(r) between atoms and other basic numbers such as the particle mass and density, the theory 
has calculated most of the features of the liquid with great success. By using correlated basis 
functions, the ground state properties such as g(r), S(k), ng, condensate fraction fọ, and off- 
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FIGURE 11.9 The excitation spectrum of liquid *He. The points are the experimental result for the superfluid. The 
other lines are various theories, including Bijl-Feynman (B-F) and backflow theories which use the Kirkwood 
superposition approximation (KSA) or convolution approximation (CA). Solid points are the Feynman—Cohen theory. 
Source: Feenberg (1969) (used with permission). 
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diagonal density matrix R(r) are obtained. Most of these quantities, or their Fourier trans- 
forms, can be compared favorably with experimental results. Similarly, the excitation spec- 
trum can also be calculated. After some work, a decent curve for œ(k) is now found which 
compares well with the experimental results. All of the features of liquid *He have been 
derived starting from a microscopic theory. The excitation spectrum w(k) is all that is required 
to find the macroscopic properties, such as the superfluid density in the two-fluid model, and 
the superfluid flow properties. The macroscopic theory of superfluidity in *He can be derived, 
step by step, from a knowledge of only V(r) plus some fundamental constants and masses. 
This achievement is remarkable for such a strongly interacting and highly correlated system. 


11.1.8. Superfluidity 


This section explains why the liquid is superfluid, and shows how to compute its 
superfluid properties. The explanation was first provided by Landau (1941, 1947), whose 
derivation is given below. 

For temperatures below T, in the superfluid state, the fluid density can be viewed as 
consisting of a normal py and superfluid pẹ component. The sum of these two is the normal 
density p = py + ps. The two components py and ps vary considerably with temperature 
where p.(7,,) = 0 and p,(T = 0) = 0. The actual variation with temperature of p,/p is 
shown in Fig. 11.10. In this two-fluid model, each component of the fluid, normal and 
superfluid, carries its own momentum, energy, etc. The normal component is given by the 
following expression for T < T, 


ht dq d 
Py = intone’ l- grea] (11.148) 


Py /p (per cent) 


1.2 16 20 Apt. 
Temperature (°K) 


FIGURE 11.10 The density of the normal fluid py as a function of temperature: o, Derived from oscillating disk 
experiments; e, from the velocity of second sound. Source: Lifshits and Andronikashivile (1959). 
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which is explained later. The superfluid component is just pẹ = p — py and has nothing at all 
to do with the condensate fraction fọ. If one takes the measured excitation spectrum w(q) and 
uses it to theoretically calculate p(T), one obtains the solid line in Fig. 11.10. This calcu- 
lation is in remarkably good agreement with a number of ways to measure py(T) such as the 
viscous drag on rotating plates, etc. Indeed, Landau was able to deduce a number of properties 
of @(q) before it was measured directly from neutron scattering by fitting Eq. (11.148) to the 
measured component of normal density. In particular, the curves for py(T) show an activation 
energy, which is now associated with the roton minimum A. Landau was able to deduce the 
necessity for this roton minimum, as well as determine the speed of sound, and predicted 
quite well the entire shape of the excitation spectrum, including phonon and roton parts. His 
was a remarkable theoretical achievement. 

The Landau argument may be understood by considering a Gedanken experiment, which 
has the liquid helium flowing down a pipe with a uniform velocity v. This velocity is small, 
say 1 cm/s, which is much smaller than the velocities in the zero-point motion of the fluid. 
The total momentum of the particle flow is calculated, and it is not just Nmv, where N is the 
number of particles. 

First construct a wave function which describes the uniformly moving liquid, which uses 
the operator A? for uniform flow in (11.110). Label this wave function as ¥\(r,,1r,..., ry) 
to denote the state of uniform flow. The energy and momentum of this wave function were 
calculated earlier: 


Wir). r2, ..., ry) = exp in . a) oer “++ Ty) (11.149) 
J 

(PY |P|Y,) = mNv (11.150) 
y2 

(Y,|H|,) = E, tND- (11.151) 


The next step is to consider the excitation spectrum of this moving fluid. The multiplication of 
Y, by A, = Lkp(k) produces an excitation with momentum fk and excitation energy 
p(k) + k» v, as shown by 


($ (k)iPIp,(k)) = mNv + ñk (11.153) 
mv" 


(O (KIH Ip, (k)) = Eo +N + Oo(k) + k-v (11.154) 


2 


The result wo(k) + k + v is quite reasonable: If @ (Kk) is the excitation spectrum with respect to 
the frame moving with the fluid, then the Doppler-shifted energy, as viewed by an observer at 
rest in the laboratory, is @)(k) + k-v. 

Of course, the excitation spectrum should be w(k) + k- v rather than @)(k) + k ° v. This 
improvement can be achieved by including the self-energy in (11.147) which produces the 
improved excitation spectra. This step is done and the excitations are indeed the ones which 
are physically realistic. 

The momentum operator gives mNv + hk for a single excitation of energy w(k) + k ° v. 
In the actual system, there will be a number of excitations whose distribution is given by the 
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thermal occupation factor ng for boson excitations. The average value of the momentum 
operator is 


3 
(Y PY). = Nmv + ~| d k ħknglo(k) + kv] (11.155) 
P J (27) 

The argument of the boson distribution factor ng(Z) = 1/[exp(BZ) — 1] is the excitation 
energy w(k) + k- v in the laboratory frame. The chemical potential is zero in the superfluid 
state, since it is a form of Bose—Einstein condensation. Note that this expression is not the 
average momentum in the ground state of flowing fluid, since that would be found from the 
average of m f d°k/ (20) Akn my = Nmv, where n_, is the momentum distribution of the 
particles in the ground state of the superfluid. The total momentum in the ground state of the 
flowing fluid is just Nmv. Instead, (11.155) is the total momentum in the flowing fluid as 
contributed by the ground state plus the excitations. The excitations are produced by the 
thermal fluctuations, so that the total momentum will be temperature dependent. 

For small velocities the argument of the distribution function can be expanded in powers 
of k: v: n3(@(k) + k: v) © ngo) + Bk wn,(@) + O(v’). The first term with just ng[@(k)] 
gives a zero average of #k, and the important result is from the second term in the series 
expansion: 


Ak(k + v)n',[o(k)] (11.156) 


3 
(P) ay = Nmv + | d id 

P J (27) 
Since the spectrum n; is isotropic in k, the angular integrals are done easily and give a vector 
in the direction v in f dQ,k(k +v) = 4nvk*/3. Collect together the remaining factors shows 
that the total momentum is expressed in terms of the normal fraction of superfluid which was 
defined in (11.148): 


(P) = Nmv(1 -2x) =“ Nmy (11.157) 


The final result is that the total momentum in the flowing fluid is given only by the fraction 
Ps/p which is superfluid. The normal fraction py/p gives no contribution to the momentum. 

In a normal liquid, uniform flow is not an eigenstate because viscosity damps the flow. 
The superfluid has no viscosity, and uniform flow is an eigenstate. At nonzero temperatures 
0<T<T,, only the superfluid component p,/p can flow without damping and is an 
eigenstate with a well defined momentum. 

The physical picture of this result is very subtle. The flow of liquid is down a pipe, which 
remains in the laboratory frame. The normal component of the fluid is pinned by the pipe and 
does not participate in the flow. There are two important points to this derivation. The 
reference point for the excitation spectra is the rest frame of the superfluid. The excitations are 
made from this flowing fluid and have an energy w(k) with respect to the rest frame of the 
superfluid. However, the influence of the excitations was calculated in the rest frame of the 
walls of the container, in this case the pipe. The coordinates were shifted to the laboratory 
frame and which gave the Doppler-sh'fted energy œ(k) + k.v before doing the thermal 
averaging. The rest frame of the container and the superfluid both enter the calculation. The 
result is that only the superfluid density carries momentum when the walls are stationary. 

The flowing superfluid is in an eigenstate of the system, which can persist without 
damping. The state Y, has an eigenvalues (p,/p)Nmv of the momentum operator. In a 
completely normal fluid, such as neon or tHe for T > T,, uniform flow down a fixed pipe is 
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not an eigenstate, and the flow is dissipated by viscosity. In the superfluid, the component py, 
is not dissipated. 

This calculation should be compared with other similar systems. What if the entire 
container of superfluid is just picked up and moved at a constant velocity; i.e., what if a 
Dewar is driven on a truck? Then the walls and the superfluid are moving at the same velocity, 
and the thermal averaging should be done in their mutual rest frame, which is moving with the 
truck. Then the momentum average gives 


Nf dk 
(P) oy = Na+ | E hkn,[o(k)] = Nmv (11.158) 


Now the result is Nmv since both normal and superfluid components are moving with the 
same velocity. 


11.2. LIQUID *He 


It is certainly a misnomer to call liquid >He a “normal liquid.” The title is used here to 
distinguish it from the superfluid phase of the liquid. This system is interesting because its 
properties are so exceptional. It is really the only ideal Fermi liquid in earthbound nature, 
since electrons in metals are not ideal because their properties are perturbed by the crystal 
lattice of ions. The *He atoms are charge neutral, so that the properties of the Fermi system 
are much different from electrons in metals. They are also strongly interacting and highly 
correlated, so the noninteracting Fermi gas is not a good starting point for a description of its 
ground state. The discovery that *He has a superfluid phase, which is akin to the BCS 
superconductor, has heightened interest in this liquid. 

The theory of these systems operates at several levels. One example is the two-fluid 
model of hydrodynamic flow, and another example is the microscopic derivation of S(k) and 
g(r). The theoretical description can be viewed as a sequence of plateaux, where each plateau 
is a self-consistent description of the system. For example, the excitation spectrum of liquid 
*He is such a plateau. One feature of these plateaux is that they can be compared directly with 
experiment. The theoretical effort generally is devoted to deducing the theory of one plateau 
by using the model of the lower one. For example, in liquid *He, one does not try to derive in 
one step the hydrodynamic flow properties from the microscopic theory. Instead, one uses 
microscopic theory to find the excitation spectrum w(x), which is one plateau. Then one uses 
Landau theory to derive the two-fluid model in terms of p,(T) and p(T), which are described 
in terms of w(k). The theory is a sequence of plateaux, each of which can be compared with 
experiment, and one uses the theory to try to relate one step to the next one higher in the 
sequence. 

The same sequence applies to the properties of liquid *He. The first, and most important 
plateau, is the Fermi liquid theory which was developed by Landau (1946, 1956, 1957), who 
has provided the main insight into both *He and *He. This theory was developed as a 
phenomenological model for the behavior of low-lying excitations of the strongly interacting 
Fermi system. These excitations can be described by a few parameters, which can be deduced 
from experiments. A major goal of microscopic theory is to derive these parameters from first 
principles. The major effort along this line, as was also the case for *He, is by Feenberg and 
his associates. So far they have achieved a good level of agreement, which is on the order of 
30% at the worst and is better for some quantities. Since the Landau theory of the Fermi liquid 
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is the major plateau of the theory, this section is devoted to describing it as well as 
summarizing the microscopic theory. 

The next plateau is a description of the superfluid state, which is usually done using the 
parameters of Fermi liquid theory. These parameters play a central role in the theoretical 
development. The superfluid properties are well described in this fashion using the experi- 
mentally measured parameters. In this sense a quantitative theory is available for the super- 
fluid state. 


11.2.1. Fermi Liquid Theory 


There are excellent reviews of Fermi liquid theory as applied to liquid *He in the normal 
phase. Several of these are Pines and Nozieres (1966), Baym and Pethick (1978), Ron (1975), 
and Leggett (1975). These treatments are usually more detailed than the one here, and the 
reader is encouraged to read some of these others. 

The symbol n, is defined in (3.135) as the average number of atoms in the Fermi liquid 
with momentum p. In a noninteracting Fermi system, at zero temperature, this distribution is 
Ny = ©(kr — |p|). Of course, in a strongly interacting Fermi system, this distribution will be 
strongly smeared. Figure 11.11 shows the momentum distribution for *>He measured by 
neutron scattering at T = 0.37 K by Mook (1985). The points are experimental. The solid line 
is a fit to the Fermi distribution n(p) = 1/[exp(&6,/kgTg) + 1] with Tg = 1.8 K. The smearing 
caused by particle—particle interactions can be represented by an effective temperature in the 
Fermi distribution. 

Throughout the remaining part of this chapter, the symbol n, is used to mean something 
entirely different from its usage elsewhere in this book. Whereas before n, always is the 
distribution of states with free-particle momentum p, now it has a different meaning. Fermi 
liquid theory has a well-entrenched set of notation, with a universal set of symbols, whose 
meaning is well agreed upon. These strong conventions are followed here, and now n, is the 
distribution of excitations of momentum p. 

The objective of Fermi liquid theory is to describe the low-lying excited states of the 
fermion system. There is a ground state which the system would be in at zero temperature 
except for superfluid phases and this ground state will be discussed in the next section. In 
analogy with the electron gas, two kinds of low-lying excitations are expected. The first are 
density oscillations, which are plasmons in the charged electron gas. In the neutral Fermi 
liquid they are sound waves. The second type of excitation is particle-hole pairs. There is also 
a third kind of excitation, which is a damped spin wave, called a paramagnon. Fermi liquid 
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FIGURE 11.11 Momentum distribution n, of particles in 3He at T = 0.37 K. The solid line is a least-squares fit to 
the Fermi distribution with an effective temperature of T p= 1.8 K. Source: Mook (1985) (used with permission). 
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theory is primarily aimed at a description of the properties of low-lying particles and holes. 
However, the other excitations—density oscillations and paramagnons—can be viewed as 
collective resonances of these particle and hole excitations. Fermi liquid theory also describes 
them. Indeed, many of the so-called Fermi liquid effects actually come from the collective 
properties. The discussion will follow the historical pathway and first treat the classical Fermi 
liquid theory due to Landau. Then the modern interpretation in terms of collective properties 
will be discussed. 

By using the parameters in Table 11.1, one would calculate a Fermi energy of Ep) = 
4.97 K for a noninteracting gas of the same density and mass of *He. The actual degeneracy 
energy in liquid *He is about two-thirds of this value. Fermi liquid theory is valid only for the 
particle-hole excitations whose energy is a small fraction of the degeneracy energy—say 
5—10%. The excitations in this theory have low energy, say less than 0.1—0.2 K. If these 
excitations are used to describe the thermodynamics of the excited states of the system, it can 
only apply for temperatures below 0.1—0.2 K. Otherwise, the thermodynamic properties will 
involve states beyond the range of the theory. Fermi liquid theory is concerned with very low- 
energy excitations, which are important only in the liquid at very low temperatures. Since the 
superfluid properties begin at a temperature two decades lower than this, there is an appre- 
ciable temperature range over which Fermi liquid theory is applicable. 

In the homogeneous electron gas, the imaginary part of the electron self-energy, from 
electron—electron interactions, vanishes for electrons whose energy is right at the chemical 
potential u. Furthermore, Im È is also small for states near the chemical potential, since it 
vanishes as (€p — u). The low-lying excited states of the Fermi liquid have a long lifetime. 
The same phenomenon is assumed to happen in *He, and general theorems can be proved 
which demonstrate this with rigor (Luttinger and Nozieres, 1962). In cases where Im È ~ 0, 
the spectral function A(p, œ) is sharply peaked and has the character of a delta function. There 
is a unique relationship between energy and momentum in these cases, and the excitation is 
satisfactorily described by only one of these two variables. In Fermi liquid theory this variable 
is conventionally taken to be the momentum p. An excitation of momentum p is assigned an 
energy Ep. 

The energy zero is defined where the excitation energy €, is zero at p = 0. Furthermore, 
if the energy is expanded in a Taylor series about point p = 0, the odd terms in this series 
p””"*+! must vanish because of the homogeneity of the liquid and the isotropy between p and 
—p. The first term in the expansion is p*, and other terms are usually ignored. Write Ep X D’, 
and the constant of proportionality defines the effective mass m*, , = p?/2m*. In liquid *He, 
the effective mass has a value of m* = 2.76m, where m is the bare mass of *He. These and 
other experimental properties are reviewed by Glyde (1994) and by Vollhardt and Wolfe 
(1990). 


TABLE 11.1 Properties of liquid helium (pressure = 1 bar). The parameters 
©, £ are for the Lennard-Jones potential 


Parameter 4He 3He 

m 6.65 x10777 kg 5.01 x107% kg 

p 2.18 x10 atoms/m? 1.64 x10? atoms/m? 
C(T =1K) 238 m/s 182 m/s 

o 2.648 Å 2.648 Å 

E€ 1.484 x107? J= 10.7 K 1.484 x107? J= 10.7 K 


E/N —7.20 K/atom —2.52 K/atom 
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The ground state of the fermion system does not allow all the excitations to be in the 
state with p = 0. The exclusion principle forces many excitations into states of nonzero p, and 
all energy states are occupied at zero temperature for €p <p. Define Ny as the number of 
excitations of momentum p in the ground state, in which case the ground state energy is 


Ey = Eo + ¥ pip (11.159) 


where Fy is the energy when all the quasiparticles are in p = 0, which would be the case if 
they were bosons. Of course, the result (11.159) is actually nonsense, since the approximation 
of treating the excitations as free particles with an effective mass m* and no damping is valid 
only when |€, — u| < kgT and does not apply throughout the degenerate Fermi sea. The 
quantity np is a totally fictitious concept. However, it is never used for any property of np 
except at the chemical potential, where the following form is assumed at nonzero temperature: 


l 
eP(Ep —H) + 1 


This approximation is surely valid only very near the chemical potential B(ep — u) < 1, which 
is the only place it is used. 

The important feature is the excitation of the system above the ground state. When 
excitations are present, the distribution n, of these excitations will be different from the 
ground state distribution Np. The difference between these two distributions is defined as 
Sn, = Np — ny. The important physical idea is that the crucial quantity is neither n, nor n9 but 
rather np, since it gives the number of excitations in the excited state which is important for 
the thermodynamics of the system at low temperature. Neither n, nor Np can be determined 
with great accuracy, but òn, can be determined, which is all that matters. For example, the 


total energy of the system, including the excited states, is 


Ny = np(Ep — U) = (11.160) 


E = Eo + Ð Epp (11.161) 
po 
= Ey + X epn? + Ye, [ny — 19] (11.162) 
po po 
= E, +> e,8n, (11.163) 
po 


The excited state energy is ) €,07). 

The important quantity is not the energy but the free energy F = E — uN, where N is the 
total number of particles. The particle number is defined in terms of the number of particles in 
the ground state No 


Ny = ond (11.164) 
po 
N=} n =} n t+ do omy = No + 5N (11.165) 
po po po 
F = E — WN = E, — UNo + } (€p — won, (11.166) 
po 


There are two key assumptions contained in these equations. The first is that the chemical 
potential u does not change because of the excited states, although it does change with 
temperature. The second assumption is that there is a one-to-one correspondence between 
quasiparticles and the excited states of the noninteracting Fermi gas. Adding a quasiparticle 
also adds a particle and changes N by unity. Of course, usually there are about equal numbers 
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of quasiparticles above €, > 1, which have on, > 0, as there are a depletion of them for €, < p, 
which have òn, <0. The latter excitations are called holes. 

So far the Fermi liquid theory has been equivalent to calling the excitations free particles 
with an effective mass m*. If that were all of it, the theory would be unexceptional and not 
quantitatively accurate. The genius of Landau was in realizing that the expansion for the free 
energy must have another term, which 1s due to the interactions. This contribution is required 
for a self-consistent theory. The formulas derived so far are the terms in an expansion of the 
free energy in the excited distribution 6n,. That is, F(6n,) is a functional of dn, and can be 
expanded formally in a Taylor series in this parameter. The Landau theory assumes that on, is 
small and that an accurate quantitative theory is achieved after a few terms in this expansion. 

The terms derived so far are given in (11.165). The first term is Fy) = E, — uN. The next 
term appears to be first order in 6n,. A closer inspection shows that this term is actually of 
order (Sn). The summation over p will run over all values for which on, is nonzero. 
However, the integrand contains (£p — H), which vanishes at €, = p. If dn, has value up to A 
away from the chemical potential, the kinetic energy integral X „(€p — W)8n, œx A*. This term 
is second order in the quantity 6n,. Landau saw the need to include all terms of order (ön). 
There is another term which describes the interactions between the excited quasiparticles and 
so contains the dependence 6n,06n,,. This interaction is very dependent on the spin of the 
quasiparticles. The free-energy expansion has the form 


F = Fo + 2 (Ep 7 H)õnpo + x > Joop’ p'o’ 'ÒNpo Np o' + O(n)" 
po 


pp’,oo’ 


where spin has been added to various quantities. This dependence on the particle spin is not 
due to the spin-dependent interactions. Indeed, the dipole-dipole forces between the nuclear 
moments are extremely small and may be neglected when discussing most phenomena. The 
spin dependence of the interaction is merely due to particle statistics. The antisymmetrization 
of the many-particle wave function for fermions causes an exchange hole around each 
particle. The exchange hole causes an effective exchange energy, just as it did for the electron 
gas. The exchange energy may be qualitatively understood by a consideration of the mutual 
scattering of two *He atoms. Since the two-particle wave function must be antisymmetric, the 
scattering properties depend on the total spin state of the two spin one-half particles as 
described in Sec. 11.1.1. If they are in an S = 0 state, the orbital cross section contains only 
even angular momentum components; if they have a total S = 1 state, only odd angular 
momentum components enter the cross section. The scattering and interaction depend on the 
relative spins of the two particles. It is a very important effect in liquid °He, since the effective 
interaction fpo,p'o' 1S very spin dependent. 
The spin dependence of the effective interaction is written as 


Spo, p'o' = fon’ +o: O'fop (11.167) 


where © = (0,, G,, Oz) refer to Pauli spin matrices. Equation (11.167) is a very particular 
choice for the form of the interaction. Other possible spin dependencies are ignored, such as 
spin-orbit effects (ao + p)(o’ +p’) or spin-other-orbit effects (© - p’)(o’ +p). The choice of a 
simple o@ +o” term derives from our understanding that the basic spin mechanism is due to 
exchange forces, which can be written this way. 

Fermi liquid theory is really only accurate for describing the interactions of quasi- 
particles on the Fermi surface. In the interaction terms pp’ and fop» the momentum variables 
have the magnitude kp, and the only important variable is the angle 8 between p and p’. The 
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interaction terms are represented by an expansion in Legendre polynomials. It is also 
conventional to normalize the coefficients in this expansion by removing a factor of the 
density of states at the Fermi energy Np = m*k,/17h’: 


S,a l Sas S,a 
Sop’ =y" P,(cos 8) (11.168) 


The coefficients Ff and Ff are sometimes called F, and Z;. They are the fundamental 
parameters in the theory and are dimensionless. They can be deduced from experiments, as 
shown later. Reliable experimental numbers are now available for F}, Fj, Ff and F?. A major 
goal of the microscopic theory is to derive them. 

The term fpo,p'o' describes the interaction between excited quasiparticles. The interac- 
tions cause the energy of an excited quasiparticle to depend on the number 6n,, of other 
excited ones. The quasiparticle energy can be formally derived as a functional derivative of 
the total energy EF: 


1 
E = E, +) E€pônpo + zy 2 Jpo, p'o Mpa npo (11.169) 
po pp’,oo 
oF _ i 
Sôn) = Ep = Ep + y Z pop'o Mp (1 1.170) 


The energy of the excited quasiparticle is called €,. It depends on the number of other excited 
quasiparticles through the interaction term. One result of the interactions is that the total 
excitation energy is no longer equal to )/&,6n,,, since this expression overcounts the 
pairwise interaction term. The chemical potential u is not altered by the density of excited 
quasiparticles as long as this density is very small compared to the density of atoms. 

It is time to reconsider the distribution of excited quasiparticles. The same notation is 
used: i.e., that n, is the total number of excited quasiparticles and that np is the number of 
excited quasiparticles in the ground state with energy €p. Another symbol is needed to denote 
the quasiparticle density in terms of the actual energy variable €,. Call np the equilibrium 
density of excited quasiparticles with energy €,, which is defined in the usual way in terms of 
the Fermi distribution function: 


ñp = Ne(Ep — H) (11.171) 


Siing = Mpg — Ny (11.172) 
Similarly, the quantity 6n,, is the departure of the total number npo from this equilibrium 
concentration. The two quantities npo and 6n,,, are not independent and are related through 
the Fermi liquid parameters. This dependence can be derived by assuming that npo is a small 
quantity and expanding 6n,, in this parameter. The key step is the recognition that the 
difference (€p — €p) is proportional to on, 


Ong = Mpg — np (Ep — H) (11.173) 
= po — Np(Ey — H + Ep — Ep) (11.174) 
_ dnp(€p — H) 
= Nyo — Np(Ep — y) — Ey — e) (11.175) 
p 
dnp 1 


Mpo = ÔNpo — de y fowo ÔN yo (1 1 . 176) 
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The first two terms in (11.175) are just npo. The third term contains the factor (Ep — Ep) 
which is also proportional to 6/,,. It has the factor —dnp (Ep — )/de,, which becomes the 
delta function o(€, — p) at zero temperature. One can show that every term in Eq. (11.176) 
has this factor. Divide it out of the expression. Furthermore, the spin dependence of 5”,,, is 
written as a symmetric or antisymmetric combination 67), = dn) + dn). All these results are 
collected, and the distribution functions are expanded in spherical harmonic functions Y,,,. 
The first two equations below define 6;'", etc., while the third equation is the integral 


equation (11.176): 
_ dn FlEp — 


ony. = Je 2y Yin(9, Pònt, + òng,) (11.177) 
Ep 
d — 
õñ = — rrp Z H) E Fm (O, PRp Bf) (11.178) 
p 
—S,a S,a dp dQ, dn 
ang = Sig +2 |? ook ae r(- ze) YRO, D) SSE Yy, nE (11.179) 


Integrate over angle and collect all terms with the same angular dependence, which provides 
the relationship between the two distributions of excited quasiparticles: 


Fe „a 
= 6n,"" 11.180 
These results are useful later. 

The distribution 67,,, seems more important than 6n,,,. For example, Pines and Nozieres 
show that the particle current operator is given correctly by both of the following expressions: 


J= L Bigg (11.181) 
po 

J= Z bn, (11.182) 
po 


The two current operators (11.181) and (11.182) can be used to derive a relationship between 
m* and m. The current operators are nonzero if the distribution functions 67,,, and 6n,,, have 
a nonzero component of angular momentum / = 1, say the value / = 1, m = 0. Then the two 
current expressions can be evaluated by using the definitions of 67, and 6n,,, in (11.180): 

m*  Önio 


_ s 
m ON‘ 


=1+}Fi (11.183) 


A measurement of m* gives the value of F}. It was previously remarked that m* ~ 2.76m, so 
that Fi ~ 5.3. The Fermi liquid parameters can be large relative to unity. 

Table 11.2 shows some predictions of Fermi liquid theory regarding the effect of the 
quasiparticle interactions upon measurable quantities. The first is the specific heat, which is 
enhanced by the effective mass m*, as it is for the electron gas. The quantity Ff is obtained by 
measuring this quantity. The isothermal compressibility and also the velocity of ordinary 
sound are changed by the ratio of (1 + F5)/(1 + F}/3). For a noninteracting gas of *He 
particles, one would estimate the sound velocity from the formula c = v%./3 to be 95 m/s. 
The experimental value is 182 m/s. The ratio is c/cọ = 1.91 gives the value of F$ = 9.15 
since Fj is known. A recent compilation of these parameters is found in Table 11.3, as given 
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TABLE 11.2 Fermi liquid theory:Nro = mkrn? h? 


Quantity Free particles Fermi liquid theory 
2 * 
. C 
Specific heat Cro = KG TN == 1 +4Fi 
yo m 
4: 1 2 K 1+ Fs 
Compressibility — = 10 A_ Tt 2 
K; Nro K, 1+F%/3 
, v? ? K 
Sound velocity c = o E (<) = — 
mNro 3 Co Ky 
. a 1+ EF? 
Spin susceptibilil = pN X- | 
p p ty Xo Ho FO Xo 1 + Fa 


TABLE 11.3 Fermi liquid parameters: A, = F,/{1 + F,(2é + 1)] 
(from Greywall 1983) 


Parameter Experiment Parameter Experiment 
Fo 9.15 As 0.90 
FY 5.27 A; 1.91 
FG —0.70 AG —2.33 
Fi —0.55 AG —0.67 


by Greywall (1983) who found that m* = 2.76m. The other parameters AF“ shown in Table 
11.3 are explained later. 

The other interesting number is F8, which is the isotropic term in the spin-dependent 
part of the interaction. It is deduced from the spin susceptibility, which is found to be much 
larger than the free-particle value y,). The enhancement of x% over Xo suggests that fF is 
negative and near unity, so that the denominator is small in the theoretical expression for the 
susceptibility. Later it is shown that this has an important consequence for the theory of 
superfluidity. The large positive value of Fj and the negative value of Ff combine to suppress 
the quasiparticle interaction in the s-wave channel and enhance it in the p-wave channel. It is 
this reason that the superfluid phases in *He pair up in the spin triplet state, which has a p- 
wave orbital part. 


11.2.2. Experiments and Microscopic Theories 


The microscopic theories of liquid >He are even more complicated than the theories of 
He, because the ground state is more complicated. The complexity is due to the anti- 
symmetrization of the many-particle wave function. If this ground state wave function is 
defined as ‘P,(x;,...,Xy), where x; = (1;, 0;) are general coordinates for both position and 
spin, then the wave function changes sign under the exchange of any pair of coordinates x; 
and x;: 


WW O, X2, 06 Xj ee Myo e Xp) = WF x1, X2, 02 Kip Xis e+e Xy) (11.184) 


For a noninteracting system of fermions, the particle states are described by single-particle 
orbitals o,(x;), and the many-particle wave function in the Hartree-Fock approximation is a 
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Slater determinant of these orbitals. The determinant has the property that no two particles 
can occupy the same single-particle orbital state, or else the wave function is trivially zero. 

The nature of the ground state wave function is less clear in this strongly interacting and 
highly correlated liquid. The microscopic picture of the liquid has the atoms bouncing around 
inside of a small cage defined by the average position of neighboring atoms. This picture is 
correct for liquid *He and leads to a correlated wave function of the CBF form, which is 
highly successful for describing bosons. It is not nearly so obvious how to construct a ground 
state wave function for highly correlated fermions. There probably should be something 
equivalent to the CBFs but for fermions. The question is how the fermion nature of the 
particles is expressed in real space. When visualizing the atoms bouncing around, how is this 
motion different when they are fermions rather than bosons? How does one construct a CBF 
which has the fundamental antisymmetrization property (11.184)? 

Feenberg and his associates have provided the most successful microscopic theory to 
date, which compares favorably to the available experiments. Their ground state wave 
function for liquid *He has the form of an algebraic product of two familiar wave functions: 
the CBF with the form used for ground state of *He and a Slater determinant of plane-wave 
states with spin: 


Y (x1, XQ, 2065 Xy) = Yo(r,, Po,..., ry )®,, p, 1 Ley Xy) 
. (11.185) 
b 
Yor), r>, ...5 Ty) = Ly exp -5 
i>j Ir; E r;| 
D, p, Œ- Xy) = det Ibp, (1), (2) - | (11.186) 


The CBF was taken to be the form (11.62) of a product of pairwise correlations, with a single 
short-range repulsive term as in (11.77). The value of b changes slightly from the value used 
in tHe, because of the mass difference. The Slater determinant is the same one which would 
be used for a system of noninteracting particles and contains single-particle wave functions 
,(x) which are plane-wave states and spin functions for spin up («) or down (P). The Slater 
determinant provides the necessary antisymmetrization, while the CBF provides the necessary 
short-range correlation in the atomic motion. 

This wave function is used to calculate ground state properties such as the energy 
E, = (g|H|g)/(g\g), S(k), etc. These calculations are extremely complicated and will not be 
described here. Interested readers are referred to Feenberg’s book (1969). All many-particle 
matrix elements are evaluated by a cumulant expansion which must be carried to high order. 
They involve the evaluation of three- and four-particle correlations in the liquid state. Some of 
their results will now be summarized. 

The ground state energy is discussed first. Feenberg and his associates first calculated the 
ground state energy assuming >He was a boson. That is, they evaluated E) = (YH |Y.) 
where Y, is the CBF for liquid *He. Since this wave function is symmetric under coordinate 
exchange, Ey is the energy of the equivalent boson liquid. The value of EF, differs somewhat 
for the different ways of doing the cumulant expansion and estimating three- and four-particle 
correlations but has the typical value of Ey = —2.9 K/atom. A boson liquid with the mass of 
3He is less bound than liquid *He, which reflects the increased zero-point motion of the 
lighter mass. Although the density of liquid *He is lower than liquid He, the average value of 
kinetic energy per particle is nearly the same value of 13.4 K/atom. The average value of 
potential energy is —16.3 K/atom, giving the difference of —2.9 K/atom. Thus the average 
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kinetic energy per atom is still high, and the particles still have short-range fluctuations in 
their position within the cage defined ‘by the neighboring atoms. 

The next calculation is to add the Slater determinant of the single-particle orbitals to the 
ground state wave function and to compute the new ground state energy. It is defined as 
E, = Eo + E, where SE is the change in energy due to the motion of the particles in the 
Fermi sea. For a gas of noninteracting particles, this change would be ôE = 
3E no /5 = 0.6(4.97) = 3.0 K. Feenberg and his associates find the smaller value of òE = 1.7 
K/atom, which is a small number compared to the average kinetic energy in the boson ground 
state, ~14 K/atom. The short-range motions of the particles are more important than the 
kinetic energy of motion in the Fermi sea. The final prediction of the ground state energy per 
atom E, = —1.2 K/atom is only in fair agreement with the experimental value of —2.52 K/ 
atom. However, the least accurate theoretical number is probably E>. There could easily be a 1 
K error in the calculation of this quantity, as there was in the similar calculation for *He, 
where it was less noticeable as the difference between 6 and 7 K/atom. This number is 
difficult to find accurately because of the large cancellation between the kinetic and potential 
energies and their sensitivity on the trial function for the pairwise part of the CBF. The 
agreement between theory and experiment, in liquid He, must be regarded as satisfactory. 

The next ground state property is the liquid structure factor S(x). It can be measured by 
both X-ray and neutron scattering. Figure 11.12 summarizes theory and X-ray data. The 
points are experimental, and the lines are the theories of Massey (1966) and Massey and Woo 
(1967). The agreement is obviously excellent. Since the theories were published as predic- 
tions several years before the experiments, the theoretical success is even greater. There is 
great similarity between S(k) for liquid >He and “He. Both have peaks near k ~ 2.0 ÅT! and 
then rapidly approach unity with small oscillations at large values of k. The S(k) values for 
3He are about 40% higher than for *He in the low k region. The good agreement between 
theory and experiment for S(k) means that similar good agreement exists between their 
Fourier transforms, which are the pair distribution functions g(r). 


ô 
96 
oaa 
S660000° © 


2 
(2p) 
o ACHTER & MEYER 
0.50 ° HALLOCK 
— MASSEY 


--- MASSEY & WOO 


Q5 LO 1.5 20 2.5 30 35 40 
tr 
B(A") 


FIGURE 11.12 Plot of S(k) for liquid >He. The points are the X-ray scattering data of Achter and Meyer (1969) and 
Hallock (1972). The solid curves are the theories of Massey (1966) and Massey and Woo (1967). The agreement is 
obviously excellent. 
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FIGURE 11.13 Zero sound mode energy in liquid *He. The points are from Sköld and Pelizzari (1980). From Glyde 
(1994, used with permission). 


The liquid must have a long-wavelength collective oscillation. In ordinary liquids it is 
just the longitudinal sound wave. In the electron gas it is at the plasma frequency ,, where 
the charge of the electron gas makes the mode have a nonzero frequency even in the limit 
where q — 0. Since liquid 3He is electrically neutral, the long-wavelength density oscillation 
will just be a sound wave, which is called the zero sound mode. Figure 11.13 shows the zero 
sound mode in liquid >He as measured by neutron scattering. The cross-hatched region in the 
lower corner of the figure are the particle-hole pair excitations out of the Fermi sea, as is 
found in metals. They are calculated using m* = 3m. 

There are several different kinds of sound waves which can propagate in these systems. 
The ordinary sound waves at very long wavelengths are called first sound. Their speed 1s 
calculated by including the particle collisions, since such collisions are very frequent 
compared to the sound frequency. This results in a hydrodynamic version of the sound 
propagation. At much higher sound frequencies, the collision rate between particles becomes 
small compared to the sound frequency, and the speed must be calculated in the collisionless 
approximation. This regime is called zero sound and has a higher velocity. 

The third excitation of liquid *He are the spin fluctuations at low energy. These will be 
treated fully in the following section. 


11.2.3. Interaction Between Quasiparticles: Excitations 


Fermi liquid theory is not a complete description of all the dynamical motion of 
quasiparticles in liquid *He. The theory was constructed by Landau to explain thermo- 
dynamic and static measurements such as specific heat and spin susceptibility, and it does that 
very well. In order to explain other measurements, it must be decided whether enough 
information is available in the Fermi liquid parameters or whether additional facts are needed 
about the quasiparticles. The additional measurements are transport experiments such as 
thermal conductivity and viscosity. A theory is also needed for low-energy spin fluctuations 
and also superfluidity. So far all that is known about the quasiparticles are their effective mass 
and the four interaction parameters Fọ'{. It must be decided what these parameters mean in 
terms of the microscopic scattering theory, and what additional information is needed to fully 
describe the interaction between quasiparticles. 
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It is useful to remember the main features of electrons in metals, which is the most 
understood Fermi liquid. The electrons at the Fermi surface have an effective mass m* which 
is comprised of several contributions. Those of the electron—electron interactions, and also the 
band structure, gave a contribution to m*/m which was not very energy dependent. However, 
the contribution from electron-phonon interactions was often very large and varied strongly 
with energy, so that it was negligible a Debye energy away from the Fermi surface. In liquid 
3He, one should ask the question whether the ratio m* /m ~ 3 changes rapidly or slowly as the 
energy of the quasiparticles is varied. Or, to rephrase the question, what is the appropriate 
energy scale which determines the variation of these parameters? Another motivation for this 
question is provided by recalling the BCS equation for the transition temperature of a 
superconductor: 


Fro 


The exponent has the factors NV, which are the density of states and the effective interaction 
between quasiparticles at the Fermi surface. These two parameters can be estimated from the 
parameters of Fermi liquid theory. However, the prefactor in this equation contains the Debye 
energy ip, which is the effective energy range over which the phonon attraction between 
quasiparticles is operative. This effective energy range of the interaction cannot be obtained 
from the parameters of Fermi liquid theory and must be deduced from other considerations. 

In elemental metals, superconductivity is usually caused by the electron-phonon inter- 
action. This theory has been made very quantitative. The accuracy is helped by a number of 
smallness parameters, such as c,/vp, ©p/Epr, and m/M, in which the theoretical expressions 
can be expanded. None of these apply to liquid He. It has c, >vp, and the energy of high 
frequency sound exceeds the Fermi energy of Ep = kgTp © 3 K = 0.26 meV. In fact, no 
obvious smallness parameter comes to mind. Nevertheless, there probably is one since the 
ratio of T /Tp ~ 107° is similar between metals and liquid *He. 

The scattering of two quasiparticles can be described in the following manner. Two 
quasiparticles initially have momenta p, and p,. They interact and exchange momentum q 
and end up in the states p, = p; + q and p, = p, — q. The scattering of the quasiparticles is 
described by a phenomenological matrix element M,,/(p,, P2; P3, P4). The spin indices are 
included, since the effective scattering is spin dependent. The goal is to obtain a quantitative 
description of this matrix element. It is defined to include exchange events, so it describes the 
direct scattering plus the exchange scattering. 

The first step is to find the relationship between this matrix element and the Fermi liquid 
parameters. This identification is found by an intuitive argument. In a weakly interacting 
Fermi gas, the Hartree energy is given by 


Ey = Enr éV a= 0) (11.188) 
p 


The self-energy diagram for Xy has a particle of momentum p scatter from the other particles 
with momentum p’ and density np(&py). No momentum is exchanged, so the self-energy is 
expressed in terms of the potential V(0) at zero-momentum transfer (q = 0), which is 
equivalent to the forward scattering amplitude for the two particles p and p’. The exchange 
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self-energy 2%,(p) just provides the exchange corrections to this forward scattering. Applying 
this theory to calculating the ees of quasiparticles in liquid *He gives 


Ly + dy => D Moso (P, p’; P, Pnyo' (11.189) 


The potential V(0) has been replaced by the forward scattering matrix element 
Moso (P, P’; p, p’), and the distribution of quasiparticles is written as nyo. When an excitation 
is created, the change in this ees function due to the excitation is 


aM so/(P; B's P, p')önyo (11.190) 


This expression is very familiar, since it is exactly the form of the interaction energy in Fermi 
liquid theory, where fic po: = Moo (P, P"; p, p). The Fermi liquid parameters represent the 
forward scattering amplitude between bare quasiparticles. It will be necessary to obtain the 
scattering amplitude for other directions besides forward and also to have it for dressed 
quasiparticles of energy €p rather than bare ones of energy €p 

The objective is to have a model for the scattering of two quasiparticles. The parameters 
of the model are chosen to reproduce the Fermi liquid theory for scattering in the forward 
direction. One method is to write an effective quasiparticle Hamiltonian with creation and 
destruction operators: 


H = Se = W Cro o t3 E L MOG p+q,s ci q,s' Ckv Cps (11.191) 


The matrix element M is different from M o, because M(q) does not include exchange 
events. In the theory of the homogeneous electron gas, which has a similar Hamiltonian, the 
interactions between particles led to a screening of the interactions. The word screening 
sometimes implies a charge redistribution, so the alternate word dressing is used in liquid 
He. But the physics of the two concepts is the same: the quasiparticles rearrange all the other 
quasiparticles in their vicinity to alter the effective interaction between any pair of them. In 
the random phase approximation, this effective interaction is 


M(q) 
1 — M(qg)P(q, o) 


where P\)) is the simple bubble contribution for the polarization diagram. The denominator 
has the dressing function 1 — MP“), which is equivalent to the dielectric screening function in 
the electron gas. the forward direction is obtained by first taking œ — 0, and afterwards 
q — 0: 


M.g(q, ©) = (11.192) 


PY — —Np 

M(q) = F: 
lim, (D = Fo/Np (11.193) 
0,q—> KY 

NM- NMO __Fi , 


~ 14+N,M(0) 1+F °° 


The effective interaction Fj is replaced by Aj. The quantity F$ is the s-wave part of the 
interaction between bare quasiparticles, while Aj is the same interaction between dressed 
quasiparticles. This difference is the same as between the bare Coulomb interaction e*/r and 
the screened interaction. The dressed interaction between quasiparticles is changed, from the 
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bare interaction, by the other quasiparticles which have also been influenced to alter their 
equilibrium configuration by the effective interaction between quasiparticles. 

Since the static limit of (11.193) has been obtained, the next step is to examine the 
dynamical predictions of this model. Consider the case whereby œ >> qup. For the electron 
gas in this same limit recall that 


2 
(1) _ 74 
P (q, w) —_ m* 02 


l _ FS k2 q’ c2 q? 

] M (1) __0 "°F — 14 

osii) DE) = dat =o 
2 _ Fokk _ Vro Fo 


AT 3m2 3 1+F/3 


(11.194) 


This result may be used to obtain the dressed susceptibility, which is the correlation function 
for density—density operators: 


BO 
Xa(q, iw) = -| dte™" (T,p(q, t)p(—q, 0)) (11.195) 


NrM(q) _ ag 


lim N ,O) = a = —— 
wave Kala O) l -NrM(q) œ -cig 


(11.196) 


There is a resonance at œ ~% c,q. The large value of F$ ensures that c, > vp so œ> vrq is valid 
when @ % cq. 

Table 11.2 shows that the exact sound velocity in the long-wavelength limit is 
c? = (veo /3)(1 + F8)/(1 + F$ /3). The approximate result in (11.196) is close to the exact c, 
since Fġ is large and there is not much difference between Fj and 1 + Fẹ. The factor of 
(1 + F}/3) comes from the effective mass. The exact density—density correlation function has 
the exact same form as (11.196) but with the correct speed of sound c rather than the 
approximate result c,. The difference between these two calculations is that the long-wave- 
length sound velocity c is found in the hydrodynamic regime and is the first sound. The above 
calculation is for the collisionless regime, and c, is the velocity of zero sound. They approach 
the same value for large F$. The density—density correlation function has the long-wavelength 
limit of sound waves, which dominates the excitation spectrum. 

Define Apopo aS the Fermi liquid theory function which gives the interaction between 
dressed quasiparticles. It is also the forward scattering amplitude of the screened matrix 
element for the interaction between two quasiparticles. It may be derived from the bare 
interaction between quasiparticles f,, po by the following simple argument. The self-energy 
of a quasiparticle may be expressed as either the product of the bare interaction f,, yo’, and 
the bare additional density 5n,,, or else the product of the dressed interaction Ay, 94° and the 
dressed additional density 6n,,: 


Ep — Ep = 2 Soo.p'c' My = 2 Apopo Dipo (1 1.197) 
oO 


The interaction between dressed quasiparticles A, pq’ 1s also defined only right on the Fermi 
surface, where both p and p’ are equal to kp. All quantities are expanded in spherical 
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harmonics, and doing the angular integrals gives the following obvious formula relating these 
coefficients: 


1 
Apo, po = mA SA + o + o'A7]P)(cos 8) (11.198) 
Fl 
AF on! = FP Sng (11.199) 


~5S,a 


Using the previous relationship (11.180) between 67,7" and 6n7;", the final formula is deduced 
between A, and F; 
s Fi 
Ai = —.——~ 
1 +F7/RI+ 1) 
a Fr 
Al = a 
1+ F?/(21 + 1) 


(11.200) 
(11.201) 


The case / = 0 agrees with the RPA result obtained in (11.193). These numbers are tabulated 
in Table 11.3. The effective interaction between dressed quasiparticles is qualitatively much 
different than the interaction between bare quasiparticles. The very large value of fj) makes 49 
less than unity, so the dressing effects have a drastic effect on reducing the effective inter- 
action. Of course, the same thing happens for electron—electron interactions in metals, where 
screening makes a similar reduction. But the most interesting effect is the increase in the spin- 
dependent parts 4f. These become larger in magnitude than F7 because the latter are negative. 
This enhancement of the spin-dependent interaction is one step in the argument, which is 
given later, toward the conclusion that superfluidity in liquid *He is caused by triplet pairing. 

The next question is the method of calculating the spin susceptibility. In (11.194) and 
(11.196), RPA and Fermi liquid parameters were used to derive an approximate formula for 
the density—density correlation function. Is it possible to do the same for the spin suscept- 
ibility? By reasoning deductively, Leggett (1965) suggested that a possible form for the spin 
susceptibility in the RPA is 


(1) 
Lla o) = (11.202) 


F§/Nr)PO(q, o) 
This equation is deduced, in analogy with (11.192) and (11.193), by just replacing Fj by F8. 
A better treatment would start from a model Hamiltonian which would give this result for the 
correlation function. For the moment, accept (11.202) as a reasonable hypothesis and examine 
its dynamical predictions. 

The spectral function for this retarded correlation function is called S,(q, œ). In analogy 
with the electron gas, two types of dynamical modes are expected from this correlation 
function. The first is a collective mode equivalent to the plasmon in the electron gas and the 
sound waves in the density—density correlation function. Such modes can exist only when 
© >> qvr. The theory has already been done for the sound case, in (11.194) and (11.196), so 
change fF to Fy in these equations. The prediction is that the collective spin sound has the 
elgenfrequency 


a 


F 
O = qVF 3 (11.203) 


This frequency is imaginary, since F¢ is negative. These modes are totally damped and do not 
exist as excitations. 
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The other excitation of the fermion gas is the particle-hole pairs. The theory of spin 
pairs can be developed in an analogous fashion. Consider the case where œ < qu; and q < kp. 
Then the polarization function P“) can be approximated for its real and imaginary parts by 


Pg, ©) = Nel 1 -7 (2 )eavr - o) 
WF © (11.204) 
S.(q, o) = —2 Im[x,(q, œ)] = hs 2 4 o2 OPF — o) 
2 qUr 
=é 11.205 
T144 (11.205) 
p = NFO _ (11.206) 


o FSA + F9) 


There are no collective modes since the pole in the denominator is found at imaginary 
frequency. The spectral function is linear in energy for small values of E = hw. The same 
linear behavior was found for the electron gas when the pair contribution to Im[1/&(q, @)] was 
evaluated, as in Fig. 5.12. The same theory can be applied to the pair spectrum from the 
density—density correlation function, and the only difference is in replacing Fj by Fẹ and 45 
by A}. It is instructive to compare these two cases when pairs are made by either the spin or 
density response functions. The energy widths œ, and œ, are similar for the two cases, since 
Ap and Ap differ only by a factor of 2. However, the significant difference is in the coupling 
strength A, ,. The factor |Fy’“(1 + F5°*)|~' is quite different for the two cases, since it is 4.5 
for spins and 0.009 for density, which differ by a factor of 500. The RPA predicts that the 
spectral function for making pairs from the spin correlation function is 10° larger than for the 
density correlation function. 

The correlation function (11.202) must be regarded as ad hoc without a derivation based 
on a realistic model Hamiltonian for the interaction between quasiparticles. The only attempt 
in this direction has been using paramagnon theory, which is briefly described. This theory 
starts with a model Hamiltonian for quasiparticles of the form (11.191), with the matrix 
element M only scattering particles of opposite spin. This matrix element is taken to be 
independent of momentum and is denoted by the symbol /: 


V = r [n@m dr (11.207) 
-sð cco 11.208 
= č, k+qt “kt ~p-q4 ~p} (11.208) 


This interaction term was first suggested by Berk and Schrieffer (1966) in their discussion of 
the superconductivity effects in transition metals, such as Pd, which are nearly ferromagnetic. 
They are actually paramagnetic, with a large spin susceptibility and a low transition 
temperature for superconductivity. They reasoned that the spin arrangements in Pd have short- 
range correlation, whereby if any spin was pointing in a direction, then nearby spins were 
highly correlated and were likely pointing in the same direction. The spin arrangements were 
ordered locally, although for short durations, and these spin fluctuations are called para- 
magnons. They proposed that these spin fluctuations suppressed the onset of super- 
conductivity, since an electron with the spin pointing in the opposite direction would find it 
hard to be in the same vicinity because of Coulomb repulsion. They constructed this ad hoc 
Hamiltonian, which had a point repulsion (J >0) between particles of opposite spin. 
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This idea was adopted immediately for liquid >He by Doniach and Engelsberg (1966) 
and Rice (1967). They used it to calculate a number of properties of liquid >He. Levin and 
Valls (1978) show that paramagnon theory gives an accurate quantitative theory of dressed 
quasiparticle interaction for both superfluidity and quasiparticle transport. 

An alert reader will notice that the above Hamiltonian is identical to the Hubbard model 
which is discussed in Chapter 6. Of course, the Hubbard model was for a lattice, while here it 
is for a liquid. In the wave vector representation, the two theories are identical. The reader can 
refer back to Chapter 6 for the predictions of this model Hamiltonian. 


11.2.4. Quasiparticle Transport 


Fermi liquid theory can also be used to describe the transport of quasiparticles at very 
low temperatures. The agreement between theory and experiment provides a further verifi- 
cation of Fermi liquid theory and the derived parameters. The theoretical framework was 
provided by Abrikosov and Khalatnikov (1959), who presented an approximate theory of 
viscosity n and thermal conductivity K. The third transport coefficient is spin diffusion D, and 
an approximate theory was first given by Hone (1961). All these theories involve the solution 
of the Boltzmann equation for transport in Fermi systems, where the scattering mechanism is 
particle—particle interactions. In this case the particles are quasiparticles. Later there was an 
exact solution of the Boltzmann equation in the limit where T — 0 by Jensen et al. (1968) 
and Brooker and Sykes (1968). It would take us too far afield to solve the Boltzmann equation 
for these three transport coefficients, so the results are just quoted. The following formulas are 
exact in the limit of zero temperature: 


_ 8n?kp [/W(, p) — cos6)\] 
sre N oa On 


O 64KE W(8, )(1 — cos 6)? sin? \] 
1 450m) kT) ( cos(0/2) )| oe) 


_ 32n?ke( + FG) (Put p) = cos 8)( = cos ) MT can 


—— 3(m*) (ke TY cos(0/2) 
3— À 2 4n +5 
l1- x 4n +3 
C= Ta + Da ++ D[(n + 12a +1) A] 
1-32 (W(8, o)(1 — cos 0)/ cos(6/2)) 
K (W(8, $)/ cos(8/2)) 
7 3 (W(8, b)(1 — cos 8) sin? / cos(6/2)) 


n 4 (W(8, $)/ cos(8/2)) 


_ 1(W4,@, p)(1 — cos 8)’ sin? / cos(8/2)) 


w= 15 (W(, 6)/ c0s(6/2)) 
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The functions H(A) and C(A) are correction factors. The original solution of Abrikosov, 
Khalatnikov, and Hone had an approximate result with these quantities set equal to unity. In 
the exact solution, one also has to evaluate these correction terms. The first step is to find the 
appropriate à and then to use it in the series for H(A) or C(A). Both series converge rapidly, so 
that the answer is obtained easily. The factor C turns out to be about 0.8, so that the older 
theories are reduced by about 20%. However, the correction for the thermal conductivity is 
much larger, since H(A, ) is almost exactly 0.5, so the older expression erred by a factor of 2. 
These correction factors, resulting from the exact solution, play an important role in 
improving the agreement between theory and experiment. 

These formulas contain one or several brackets which have a factor W(0, d) plus some 
angular terms. This average is taken over the 4r solid angle: 


1 — cos 8 1 [7 " 1 — cos ð 
W——_)=—]| d dO sin(®)W(8, p) ———— 11.210 

(" aay) n, 20), 9.0) Sem 20 
The angles 0 and © are explained later. The symbol W(0, ¢) is the matrix element for the 
scattering of quasiparticles. It is conventionally defined as 


WO, $) == [14 O, OI? + 414440, OF] 


On (11.211) 

W, (9, p) = A |4}, (0, )|? 
where A,, and A}, are the matrix elements for the scattering of two spin-up particles, and a 
spin-up with a spin-down particle. These matrix elements are given by the Fermi liquid 
parameters. The factor (5) in front of |A,, 7 occurs because the scattering of identical 
quasiparticles into (k, —k) is indistinguishable from (—k, k), and the angular average counts 
both of these and so overcounts the scattering by a factor of 2. 

The factors in the numerators of (11.209) come from the effective lifetime appropriate 
to the transport coefficient. For the thermal conductivity, the effective lifetime contains the 
factor (1 — cos), just as it does for the electrical conductivity in metals. The other factor in 
this angular average is 1/cos(8/2), whose origin is now described. First examine the 
expression for the lifetime of a quasiparticle as it scatters from the other quasiparticles. 
Assume that the initial quasiparticle has momentum p}, that it scatters from a quasiparticle p,, 
and that they go to p, and p,. The lifetime for this scattering rate is written in a symmetical 
form 


l 2T | d'p ıd’ pd’ p, 2793 
—— = — | — E Mv (22) 8(p, +p — p3 — Pa) 
p) A OT a ( (P: + P2 — P3 — P4 


x ole + E& — £3 — e4)n(1 — n3)(1 — n4) (11.212) 


The occupation factors n; = np(£;) express the feature that p, is in an occupied state, while p, 
and p, must go to empty ones. The integrand contains the delta functions for conservation of 
energy and momentum, and the matrix element |44 |? for scattering (but [Ay 7 is divided by 
two). The evaluation of this quantity follows Pines and Nozieres (1966). 

At very low temperature, all the quasiparticles will be quite close to the Fermi surface. 
The magnitude of all the momentums p,(j = 1, 2, 3, 4) are very near to kp and therefore very 
nearly the same length. This feature simplifies the calculation. The delta function for 
momentum conservation is used to eliminate the d°p, integral and leaves the combination 
d°pyd*p;. Let P = p, + p, be the center of mass momentum of the two quasiparticles, which 
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is unchanged by the collision. The d°p, integral is nested inside d?p, so that the vector P is 
used as the basis of choosing the angles in doing the d°p, integral. In particular, let 
P-p; = Pp, cos(03), cos(0;) = v3. Then the vector p, has the magnitude 


p4 = |P — p3| = ./P? + p} — 2Pp393 (11.213) 


This identity is used to change the variables of integration from d0, sin(0}) = dv, to dp,, 
where pdv, = —p,dp,/P. The next step is to define P in terms of the variables used in the 
d?p, integral. Use 8 to denote the angle between p, and p, and use the fact that both are near 
kp to find P= |p; + pol = Jp? +p? + 2p,p, cos(®) © kp,/2(1 + cos 8) = 2kp cos(0/2). 
Finally, we set p3dp, ~ kpd(p3)/2 which brings us to the identity 


3. 3. z «3 sin(0)d0 
d pid p; = (m°) 5 


aa l. 
rc0s(0/2)) 1024 3 e24esd es (11.214) 


The six integration variables in d*p,d*p, have been reexpressed in terms of coordinates which 
are more useful. It is usually assumed that the matrix element has no significant energy 
variation. Then the energy integrals can be done exactly. The first one is trivial, since it just 
uses the delta function for energy conservation. The remaining two integrals can also be done 
exactly, which is assisted by the indicated variable changes: 


I = | dezdesde,8(e + & — & — &4)Np(E,)Np(—&3)np(—E,) (11.215) 
I= | dezdesnp(e2)nr(—es)e(e — £; — £) (11.216) 
x= PE) y= ePles—H) z = e bE H) (11.217) 
I = (kgTY | ax | DE (11.218) 
= (kT) are (11.219) 
=- (5) = 4 (1 — min? (kT) + (1 — WI (11.220) 


The last integral is from G&R, 4.232(3). The form of the answer is very interesting. The result 
is proportional to two terms: one is (tkgT)’ and the other is (€; — u)*. The latter vanishes at 
the chemical potential €; = p and the former vanishes at zero temperature. This form of the 
answer is found in particle—particle scattering of any like fermions. The same factor enters 
into the expression for the inverse lifetime of an electron in a metal, from electron—electron 
scattering. 

The last step is to do the angular integrals. First it is helpful to visualize the coordinate 
system. Take the case where all four vectors p, have the same length kp. Define 0 as the angle 
between p, and p,. It is also the angle between p, and p,, which is true only when all four 
vectors are the same length. Only one more angle is needed to define the relative orientation 
of all four vectors. This angle could be the relative orientation of p, and p;, but this choice is 
not conventional. Instead, note that the two initial vectors p, and p, form a plane whose 
orientation is represented by a unit vector A; in the direction p, X p». Similarly, the scattering 
plane in the final state has an orientation ñp in the direction p, X p4. Define ọ as the angle 
between these unit vectors, cos ọ = ñ; ñp. The angle ọ is shown in the vector diagram in Fig. 
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FIGURE 11.14 Angular variables in quasi particle scattering in liquid *He. 


11.14. All four vectors p, make an angle 0/2 with respect to the center of mass momenta P, 
and ọ is the angle between the two scattering planes. 

In the variable list in (11.214), the azimuthal angle 0, is actually o, while , is 
unnecessary and may be integrated to give 27. The description of the angles in the scattering 
process is now complete . The final lifetime is the following expression from Pines and 
Nozieres (1966): 


1 (m*y 


(pi) 16n4%° 


(1 —n,)[n?(kgT)° + (€; — oa} (11.221) 
The symbol W(8, ġ) has been given its definition in (11.211), where all the angular variables 
have been defined explicitly. The result for 1/t(p,) has two terms which show that at the 
chemical potential ¢, = ų the lifetime is determined by the temperature, but at high energies 
B(e, — p) > 1 it is determined by phase space availability. The equivalent quantity using 
Green’s functions is —2 Im[2X(p,, €,)] for the appropriate diagram. The imaginary part of the 
self-energy produces the same lifetime if the diagrams are selected correctly, except the 
(1 — n,) factor is absent (see Problem 7 at the end of this chapter). The factor (1 — n,) should 
not be included in the quasiparticle lifetime. 

The evaluation of the angular integrals for the quasiparticle lifetime require an expres- 
sion for W(8, o). The Landau—Fermi liquid theory gives only the forward part of the scat- 
tering amplitude, where p} = p,, P4 = Pp», or the opposite. Forward scattering corresponds to 
the case where = 0 or n. A method is needed of extending this scattering theory to other 
values of angle besides the forward direction. The extension to other angles is useful not only 
for this calculation of the quasiparticle lifetime but also for the similar calculations of the 
transport coefficients in (11.209) which have other angular averages of W(8, @). In transport 
theory, the quasiparticles not only scatter in the forward direction but to any other point on the 
Fermi surface. How can this be described by using only known parameters? 

An ingenious and very successful method for this was proposed by Dy and Pethick 
(1969), which they called the s-p approximation. The goal is to obtain the scattering 
amplitudes A,,(9, >) and A, (9, p) for the scattering of two spin-up particles and a spin-up 
with a spin-down particle. If two particles both have spin up, they must be in a relative triplet 
spin state, and this scattering amplitude is labeled A,(0, @). The singlet scattering amplitude is 
called A,(8, ġ). Two particles in opposite spin states have equal likelihood of being in a 
singlet or triplet state: 


A, 00, $) = 41, (0, 6) = 4,0, 4) (11.222) 
A700, $) = 4,100, >) = 414,00, 6) + 4,0, 0)] (11.223) 


The next step is to use the symmetry of the two-particle wave function to deduce the possible 
angular variations. In center of mass coordinates, the only relevant angular variable in the 
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scattering process is that between the initial and final relative momenta, which is p. Singlet 
states must be symmetric under the exchange of the orbital part of the particle coordinates, so 
only even values of relative angular momentum are allowed, as explained in Sec. 11.1. 
Similarly, triplet spin states permit only odd values of relative angular momentum, so that the 
symmetry of the two-particle wave functions dictates the following possible choices for these 
scattering amplitudes: 


A,(8, >) = >| C,(8)P;(cos >) (11.224) 
I odd 

A,(8,>) = >) C)(8)P;(cos >) (11.225) 
l even 


The next step is the s-p approximation. Dy and Pethick argue that since only the / = 0 and 1 
values are known in the spherical harmonic expansion for 8, consistency dictates that only 
similar terms are retained in the expansion for Legendre functions on the @ variable. In this 
case there is only one term allowed in each expansion, which is / = 0 for singlet states and 
I = 1 for triplet states: 


A,(8, p) = C,(8) cos(o) (11.226) 
A,(8, $) = CA0) (11.227) 
The next step in the derivation is to find the amplitudes C, (0) and Co(8) by evaluating them in 


the forward direction, where the answer is known. Set ọ = 0 and equate these coefficients to 
the interaction term between dressed quasiparticles: 


1 1 
C, (0) = 4,(0, 0) = K Lai + A*)P,(cos 0) (11.228) 
C)(0) = A,(8, 0) = F za; — 3A?)P,(cos 8) (11.229) 


These various steps are collected together to finally provide the angular variation for the 
scattering amplitude between dressed quasiparticles: 


COS 
Nr 


l 
A, (0, 0) = IN, {cos p[(40 + 40) + (41 + 41) cos 0] 


Ay, (0, o) = 


[(40 + 40) + (Aj + 41) cos 0] (11.230) 


+ [(49 — 340) + (41 — 347) cos 0]} (11.231) 


In older calculations of the transport coefficients, the angular functions A,,,(8, >) were 
approximated by the results in the forward direction @ = 0, since they are given by the 
Landau Fermi liquid parameters. The improvement of Dy and Pethick is to add the factor 
cos > to the amplitude for the triplet scattering. The factor of cos is a necessary addition 
from the point of symmetry, since the triplet amplitude must change sign when 6 = 7, 
because of the wave function antisymmetry. Going from 6 = 0 to ọ = t effectively changes 
the scattering (P; —> P3, P2 — p4) to the exchange event (Pp; —> P4, P2 —> P3). This small 
change of adding cos © is a big one insofar as improving the agreement between theory and 
experiment for the transport coefficients. Table 11.4 shows their calculation of the quantities 
KT, DT’, and nT’, as compared with the experimental values, using the theoretical formulas 
in (11.209). The agreement is obviously excellent. Previous calculations without the cos p 
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TABLE 11.4 *He transport results as T —> 0 


KT (erg/cm-s) DT? (cm?-K?/s) nT? (poise-K”) 
Experiment’ 35 1.4x10~° 1.8xlo~° 
Theory? 33 1.6x10~° 1.6x10~° 


a. Wheatley (1975). 
b. Dy and Pethick (1969). 


factor in A, disagreed with experiment by a factor of two. Dy and Pethick also predicted the 
transport results for higher temperatures. 

Another important success of the s-p approximation has been to explain the superfluid 
properties of liquid *He. In particular, it explains why the BCS type of pairing is not in the 
singlet state but rather is in the spin triplet state. Here the discussion follows Patton and 
Zaringhalam (1975). They calculated the temperature at which the Cooper instability occurs 
in liquid *He, which is a property of the normal state of the liquid. The construction of a wave 
function for the superfluid state is a different step, which is done in the next section. But the 
Cooper instability is a property of the normal fluid and is discussed here. 

In a BCS of pairing state, two dressed quasiparticles of momentum p and —p are 
coupled into a collective bound state. One important question is whether this pairing occurs 
with the two spins in a relative singlet or triplet state. To answer this question, examine the 
effective interaction for each configuration. Consider the scattering of a bound pair with 
(p, —p) into another pair state (p’, —p’). The two scattering particles have 8 = r, so that the 
relevant amplitudes are 


A(T, 6) = CiP +! (11.232) 
A(T, $) = Ch(m) (11.233) 


The Fermi liquid parameters are used to derive C, (r) and Co(m) from (11.229): 


N;C (m) = —2.57 (11.234) 
NpCh() = 3.35 (11.235) 


The singlet state amplitude C(7) is positive. Two dressed quasiparticles have a repulsive 
interaction when paired in the spin singlet state. They do not attract each other and do not 
form bound states. The superfluid state is not a spin singlet in liquid *He. 

For the spin triplet state, C} (7) is negative, which shows that the two quasiparticles have 
an attractive interaction. Pairing is possible in this state, and the normal fluid will show a 
Cooper instability at a nonzero temperature because of this tendency toward triplet pairing. 

The prediction of Fermi liquid theory is that liquid >He has to form a BCS pairing state 
in the spin triplet arrangement. This prediction seems in good accord with the experiments on 
the superfluid state, which are described in the next section. The triplet pairing is a conse- 
quence of the negative value of C,(m) and a positive value of Co(7). The physics question is: 
Why does this happen? The answer seems poorly understood. The parameters C, and Cp are 
derived from the interaction between dressed quasiparticles, which in turn are derived from 
the interaction between bare quasiparticles. The latter numbers are taken from experiment, 
although the microscopic theory for them gives good results. However, the chain of argument 
starting at C; and Cp runs back to other parameters (F;" ) which are hard to calculate. It is 
difficult to have any intuitive insight into any of these parameters. 
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11.2.5. Superfluid *He 


Superfluidity in liquid >He was discovered by Osheroff et al. (1972), who discussed it in 
a series of papers. Their Nobel Lectures are a nice review: Lee et al. (1997). They identified 
two phases in the liquid, which are called the A and B phases. The phase boundary on a 
pressure vs. temperature curve is shown in Fig. 11.15 as measured by Greywall (1986). The 
line marked T, separates normal liquid *He from the superfluid phases. It is a second-order 
phase boundary with a jump in the specific heat, which are both characteristics of a BCS state. 
Both A and B phases are superfluid and are experimentally different. The line marked 7, is a 
first-order phase boundary. The location of this phase boundary is strongly affected by small 
magnetic fields. Indeed, both the A and B phases have interesting magnetic properties which 
reinforce the notion that pairing is in a spin triplet state and that the pairs have a net magnetic 
moment. The theory of triplet pairing is given below. 

The theory of superconductivity for electron spins in a triplet state was derived by Balian 
and Werthamer (1963). They developed this theory in the expectation that some metals might 
be superconducting with this spin configuration. The first important application of their result 
is in explaining superfluidity in liquid *He. Their theory is similar to the BCS theory, with 
weak coupling between the pairs of fermions, except that the spins are assumed to be in a 
relative triplet state. However, triplet pairing makes a nontrivial change in the nature of the 
answer because of the additional spin degrees of freedom. The spin state (S = 1, m, = 1,0, —1) 
can project in different ways on the relative orbital motion (L = 1, m; = 1, 0, —1). The paired 
states have a number of possible values of total angular momentum J = L + S. The gap 
equation is a matrix equation. It has a number of possible solutions, and the order parameter is 
actually a spinor of dimensionality 7. This feature leads to a rich description of the types of 
motion and ordering of the superfluid. All this complexity is just to describe one of the two 
phases. The theory of Balian and Werthamer is believed to describe the B phase of the 
superfluid. The A phase is another type of triplet pairing, which is treated later. 


P(bar) 


superfluid 


B-phase 


normat Fermi liquid 


0 0.5 2.5 3 


1.5 2 

T(mk) 
FIGURE 11.15 Phase diagram of *He as a function of pressure and temperature. The line T, divides normal from 
superfluid phases. The line T}; divides the two superfluid phases, and the horizontal line at 34 bars divides solid from 
liquid. Source: Greywall (1986) (used with permission) 
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The BW (Balian and Werthamer) theory is solved by following the same steps used to 
derive the BCS theory in Chapter 10. The first step is to write an effective Hamiltonian 
between dressed quasiparticles: 


H = X bp Cho Cp -+> TP VigCs sac ch -av Cpo Cpo (11.236) 


The interaction potential V(q) is actually a function of the momentum change between 
relative wave vectors p; and py in the scattering process, where Pr = P; + q. The interaction 
potential is expanded in spherical harmonics, and the first two terms are 


V (py - p; = 3 Vi(pi, Pp Pi; *Pr) = Vo + Vip; "Dr (11.237) 


The first term Vo acts when the orbital motion of the two particles is in a relative s state, which 
happens for a spin singlet. The discussion of the previous section showed the s-wave inter- 
action is repulsive (Vj) >0) and cannot cause bound states. The second term Vp; pp is for 
relative p states and applies for spin triplets. This coefficient is negative and can cause pairing, 
where Vi ~ C(t) in the notation of the previous section. This term is the only one that is 
kept in the interaction potential. It has the feature that it is antisymmetric in either of the 
momentum variables: V(p;, Py) = VP; * Py = —V(—P;, Pp) = —V(p;, —py). In the case of 
weak coupling theory, such as BCS, this potential is assumed to be a constant V, in energy up 
to a cutoff w, from the chemical potential. The physical nature of this cutoff is not clear. 
Obviously, a better theory is impossible without a better understanding of the retarded nature 
of the potentials between dressed or bare quasiparticles. 

The next step is to define the correlation functions appropriate to the superfluid. They 
will be of the same type we used in the BCS theory of superconductivity. One expects 
superfluidity in liquid >He to be similar to a BCS state because of the obvious experimental 
similarities. The theory requires the introduction of seven correlation functions: 


Gp,t —v) = —(T,Cy(t)Ch,(7)) 
F(p, T- T’) = (T,C_p(1)C,,(7)) 
F ((p, T— T) = (T,C_p,(1)Cp (7) 

F _\(p,t—V) = (T,C_ py (Cy i’) 
(T, 
( 
( 


) 

) (11.238) 
F\(p,t—v) = (T, OCE (7) 

) 

) 


$ 


F y(p,t—v) = (T, k OCC) 
F'(p.t-v) = reod pi’) 


The quasiparticle Green’s function ¢ has the usual definition, although it will have a different 
form in the superfluid state. The others are pairing functions for different spin arrangements. 
The first one has both spins up and belongs to the triplet state. The next has one spin up and 
another down, and appears identical to the correlation function used for the BCS singlet state. 
However, now it is for the spin triplet state and represents the configuration where 
S = 1, m, = 0, whereas in Chapter 10 it was the correlation function for S = 0, m, = 0. The 
difference between these two correlation functions becomes clearer when they are written as 


F 51 m=0(P, T — T) = —F1(T,C_pp Cp) + (T,C_py OC (7))] 


— oo (11.239) 
F s=0,m=0(P; T = T) = — 3 iT:C-p10)Cp (’)) — (T:C-p 0C CE 
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The first one is for the spin triplet state and the second one for the spin singlet. The triplet 
state is antisymmetric in momentum, while the singlet state is symmetric: 


S=1,m-0(—P, T— T’) = —F s1 m=0P, t— T’) 


, , (11.240) 
S=0,m=0(— P, T—T ) = F s=0,m=0P; tT — T) 


S S 


These parity relations are easily proved from the definitions (11.239), by changing the sign on 
p and commuting the operators. The simpler definition in (11.238) can be used for either 
triplet or singlet with the appropriate choice of symmetry (11.240). The simple definition is 
used, and only the antisymmetric version is used when solving the equations self-consistently 
for the correlation functions of the triplet state. 

There are four possible spin configurations for the two spin one-half quasiparticles: three 
triplets and one singlet. The most general description would have four correlation functions 
F s,m, (one for each spin arrangement) plus %, and the gap equation becomes a 5 x 5 matrix 
equation. The problem can be simplified, at the outset, to a matrix equation which is only 
3 x 3. The simplification is achieved by using the relationships in the previous paragraph, 
where m, = —1 is the Hermitian conjugate of m, = 1 and only one m, = 0 function is used 
with momentum antisymmetry. The simplification to a matrix of dimension 3 is obviously 
desirable and is conventional. 

The equations of motion method is used to obtain the gap equation in the weak coupling 
theory. The first step is to recall (10.36) for 0C,,/0t, which is then put into the t derivative of 
the Green’s functions: 


o 

7 
° G yah V 

- (5 +6) 92-2455 ro 

x (TC) gop (1)Cp-q,0(1)C}o (T) = o(t — T’) (11.242) 


l 
Cpo = —ÉpCpo — 7%, VqCh ao Cpo Cp-a,o (11.241) 


The interaction term has an expectation of four operators. These are paired in different 
arrangements, which produces a large number of terms, since pairings are allowed between 
like and unlike operators (i.e., (CC), (C'CT), (CCT)) with parallel and antiparallel spin 
arrangements. In weak coupling theory all terms are ignored except those which contain a gap 
function ¥,,. In the preceding equation, the terms which are retained are those with the 
pairing (Ciao Cho) (Cpo Cp-qs)- There are two spin arrangements, o’ = © and o’ = —o, 
which gives the first equation of motion 


-(ž + 90 t—T)— ~ Fp. t— 7) >) V(q)F \(p — q, 0) 
q 


—*Fh(p.t 1) V@)Folp— 4.0) = 80-7) (11.243) 
q 
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Equations of motion are also needed for F (p, t) and F o(p, t). They can both be done at the 
same time by considering 


F (P; t —V) = (T,C_po(t)Cy,(7)) (11.244) 
ð ~ 1 
(eth) Fem e) +5 E ra 
x (T,C_po(t) Chg oT )Cyg()Cp_g,4(V)) = 0 (11.245) 


where o = +1 gives the two gap functions. In the bracket of four operators, pair the first two 
operators to get a Green’s function Y(—p, t — T) = Y(p, tT — T), which sets o = o’. The 
other two operators are paired into a gap function. The pairings which lead to Hartree and 
exchange energies such as (C_poCpre' (Chao Cp—qt) are omitted in weak coupling theory. 
The pairing terms which are kept give the following two equations of motion: 


(5 + p) Fe T=T)— Lap, t—1))/ VAF ip — 4, 0) = 0 
q 
l 
(2 + g F, t=) ——G(p, 1-1) E VFP — 4, 0) =0 
q 


These are now Fourier-transformed in the usual way to obtain functions of Matsubara 
frequency. Then they become algebraic equations which can be solved. Important terms in 
these equations are the gap functions, which are defined as 


A\(p) = -i5 Vp- p)F (p, t= 0) 
7 


| (11.246) 
Ao(P) = -72 VP — p')F oP, t = 0) 
5 
The equations of motion are 
(ip — Ep) IP) + Fi@)A; + FiP)A = 1 (11.247) 
(ip + 6p) F mP) + Gp)A,, = 0 (11.248) 
(ip + E,)F |p) + G(p)Al, = 0 (11.249) 


The equation is also given for the Hermitian conjugate ¥ t, Then the solution is straight- 
forward: 


E(p) = & + |A,(p)? + |Ao(p)/’ (11.250) 
ip+€ 

Gp) = — —— P 11.251 

p) P + EY l ) 

Fie gp (py Îm 11.252 

F nO) = F mnp) LEY (11.252) 


The effective gap function is A(p) = ,/|A,|? + |Ao|? and the energy gap E, = 2A. In some 
respects, the BW state appears to be similar to a BCS singlet state but with an effective energy 
gap of A(p) and excitation energy E(p). This similarity does not extend to magnetic 
phenomena, such as spin susceptibility, which are much more interesting in triplet pairing. 
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The last step in deriving the self-consistent theory is to obtain the equation for the gap 
function. The correlation functions F t (p, t) are found for t = 0, with the familiar result 


1 A,,(p) tanh[BE(p)/2] 


get — f)) — get À — 
__ [EP n Am(p’) tanh[BE(p')/2] 


In the second equation, the definition (11.246) is used to derive the self-consistent equation 
for the gap function. This gap function must be solved for the energy gaps A,,(p) 

The gap equation is obviously a nonlinear matrix equation. Each term in the matrix 
depends on the excitation energy E(p), which depends on the two energy gaps Ay and A,. The 
above is a set of coupled equations for A, and A,. There is also the restriction that the gap 
function is an odd function of momentum, A,(—p) = —A,,(p), which follows from the 
condition that ¥,,(p) have this property. This condition is automatically satisfied when the 
interaction potential is taken to be the p-wave type V = Vp-p’. For example, by changing 
dummy variables of integration in the gap equation from p’ to —p’, the equation is unchanged 
when both V(p, p^) and A,,(p’) are odd functions of p’. 

The interesting aspect of the gap equation in triplet pairing is that the solutions are not 
unique but have orbital degeneracy. There are two sources of angular momentum from each 
pair of particles: the spin angular momentum and the orbital angular momentum in the p state. 
The first question is the relative orientation of these angular momenta. If two particles were 
binding in a gas to form an He, molecule, then one could just couple these momenta with 
Clebsch—Gordon coefficients into all possible values of J = 0,1, and 2. In the Fermi liquid, 
the superfluid state is a collective property, and each pair cannot be treated as individuals. 
Nevertheless, there is still some freedom in selecting the basis for the spin and the orbital 
angular momentum. This idea is best illustrated by an example. 

The gap function A,,(p) must have p-wave symmetry. The obvious configuration has the 
spin and orbital states aligned along the same axis. The first solution described below was 
found by BW. For the gap function A,,(p) they chose the orbital function according to 
Y\m(9,, Pp), where (0,, ,) are the angular orientation of the p vectors. Specifically, 


A..\(p) = (Ó + iB, )A(p) = sin(0,)e*"* A(p) (11.255) 
Ao(p) = ĝ;-A(p) = cos(8,)A(p) (11.256) 
|A(p)|? = |Aol? + 1A, 2? = Ap? (11.257) 


The functions E(p) and A(p) are even functions of p and are scalar. 

First verify that this choice satisfies the gap equation. Use the angles (0, ) for the 
direction of p and the variables (0', b^) for p’. From the law of cosines, the angle between p 
and p’ is 


pp =cos@cos6' + sin 8 sin @ cos( — p’) 


[A Ar os h id’ A í 
A, = -TEA fao -p' sin(0')e”® | a Fo an| EEE | 


— NEM ag A(p’) BE(p’) 
Aj = — Tr fao Fo anh 5 | 


(11.258) 


> 


- p' cos(6’) | ae 
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The energy gap E(p’) does not depend on the angles. The angular integrals in the gap equation 
are easy to do and give (g, = —NpV,/12 = —C,/24>0) 


A4; = Asin(®)e*® (11.259) 
Ay = Acos(8) (11.260) . 
ft AG’) BE’) 
A=g, B dE a tanh 5 | (11.261) 


For each gap function A,,(p), the angular functions are the same on both sides of the equation. 
These can be canceled from both sides, and each equation produces the same formula for the 


effective gap function: 
Ap) =A =g, D dé no tanh | PEE 2 (11.262) 


The equation is identical to the BCS equation for the singlet gap. The coupling constant is 
effectively g, This equation is easiest to solve at zero temperature where the hyperbolic 
tangent is unity, A = constant: 


Mc dé 2 
l= z| I nag in x) (11.263) 
l 
A = 2 —— 11.264 
Alar = 20. exp( -75 (11.264) 


The solution is an equation of the BCS type for the gap function |A| at zero temperature. The 
variation with temperature below T, is also similar to the BCS prediction. 

This equation describes the Balian—Werthamer state. Their theory is thought to apply to 
the B state of superfluid liquid *He, although the theory must be modified to account for the 
strong coupling between quasiparticles. However, even the weak coupling theory, as just 
derived, makes a number of predictions which agree with the experimental findings in the 
superfluid B state. Some of these will now be described. 

One important property is that the B state is isotropic, which is defined as having the 
energy gap A(p) not depend on angle in the superfluid. The best way to understand this 
definition is to give an example of a state which is nonisotropic. As a random choice, consider 
the possibility 


A, =A_, =0 (11.265) 
Ao(p) = V3 cos(8)A(p) (11.266) 


E(p) = ,/& + 3 cos? 0A(p) (11.267) 


The p-wave pairing requires that A,,(p) always depend on a p-symmetry angular function. 
The choice in (11.266) provides an entirely self-consistent solution to the gap equation 
(11.253). The normalization factor ./3 is chosen so that A(p) averages to A(p)* around the 
Fermi surface. The energy gap is no longer isotropic but now depends on the angle 0 between 
the spin and orbital motion. The energy gap and excitation energy will vary around the Fermi 
surface. A more detailed discussion of nonisotropic superfluid states is presented below. The 
motivation is provided by the superfluid A phase, which is thought to be a nonisotropic state. 

The BW state (11.256) chose axes where the coordinate bases for the spin and orbital 
motions were aligned. There is no need to make this choice, and the same gap equation for 
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A(p) (and the same solution) are obtained if these axes are rotated with respect to each other. 
For example, give the orbital coordinates a 90° rotation, and the gap functions are 


Ai = Ê, 6, AQ) (11.268) 
Ay = p,A(p) (11.269) 
|A(p)|* = Ap) (11.270) 


The gap function is isotropic and given by the same value (11.264) as before. In weak 
coupling the choice of orbital coordinates does not influence the energy gap. The BW 
superfluid state has orbital degeneracy in that the orbital motion is not coupled to the spin 
motion and can proceed independently. This feature depends on the isotropic nature of the 
superfluid phase. 

Many of the thermodynamic properties of the superfluid phase depend only on the 
energy gap. One example is the specific heat. These properties are exactly the same as 
predicted by the BCS theory for singlet pairing, which also has an isotropic energy gap. (In 
real metals, with anisotropic Fermi surfaces, the BCS gap can also be anisotropic, which is a 
complication we shall not discuss.) The BW theory predicts that the transition from the 
normal to the superfluid phase is a second-order phase transition and that the specific heat is 
discontinuous at this phase transition. These properties are observed in superfluid *He, as 
mentioned earlier. The interesting difference between singlet and triplet pairing is in magnetic 
phenomena. There are interesting nuclear magnetic resonance phenomena in the triplet states 
which were very important for identification of the superfluid phases and which verify the 
theory. These are described by Leggett (1975). 

The A phase has a different type of triplet pairing states than the B phase. The latter is 
well described by the theory of Balian and Werthamer (BW). The states of the A phase are 
also solutions to the gap equation (11.253) for triplet pairing. They are called equal spin 
pairing (ESP). They are composed of only the m, = +1 components of the spin angular 
momentum and not with the m, = 0 components. Now consider the theory when A, = 0 
while retaining the two gaps A,,. 

The first realization is that there is no relationship between the orbital momentum of the 
two gap functions A, and A_,. The equation which is solved for each A, involves the gap 
function Aj). If the latter is zero, there is no connection between m = +1. The excitation 


energy E,,(p) = ,/ E + |A,,|? involves only one gap function. Another way to understand the 
decoupling of A, and A_, is to start from the beginning and write the equations of motion. In 
describing the response of a spin-up particle, the only nozero terms are with the correlation of 
the motion of other spin-up particles. There is no correlation with spin-down particles once 
Ay = 0. The spin-up particles live in one world, and the spin-down particles live in another. 
The orbital motion of spin-up and spin-down particles is arbitrary. There are two independent 
superfluids, one of spin-up and one of spin-down, which coexist in the liquid. 

The orbital motions of each superfluid need not be correlated. A consequence of the 
orbital independence is that the excitation energies E,,(p) and gap functions |A,,(p)|* no 
longer need to have the same values for m = 1 and m = —1. For example, a possible set of 
solutions are 


Ap) = 3G, + iPAQ) = /3sin@e*A@) 


(11.271) 
A_1(p) = 36, + PAP) = /3[sin(®) co5() + icos(®)]A(P) 
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It is obvious that E (0, $) 4 E_,(0, o) except at special angular points. The excitation and 
energy gaps for spin-up and spin-down particles would be quite different. Each energy gap 
function has the same shape relative to its axis (north and south poles), but these poles are 
rotated relative to each other for spin-up and spin-down particles. 

The superfluid is called unitary if the spin-up and spin-down particles have the identical 
angular dependence of their energy gaps E,(0, 6) = E_,(8, }). ESP states are unitary when 
the north-south poles are aligned for the two systems. The superfluid is called nonunitary 
when the excitation energies are not identical as a function of angle. The BW state is unitary. 

The A phase appears to be a unitary ESP state. The north and south poles are aligned for 
the two gap functions. This is an experimental conclusion, and there is no particular reason 
the state has to be unitary. The important question is the nature of the forces which tend to 
align the poles. There are no forces in the Hamiltonian (11.236) which cause alignment. The 
Hamiltonian neglects the very, very weak forces due to the dipole—dipole interactions between 
the nuclear moments. Leggett has shown that these dipolar forces will tend to align the poles. 
Although it is a weak effect, there is nothing to oppose it. Of course, any external perturbation 
such as a magnetic field will also tend to align the poles, since alignment lowers the magnetic 
energy. Nonunitary states are discussed in reviews by Leggett (1975) and Anderson and 
Brinkman (1975). 


11.3. QUANTUM HALL EFFECTS 


The quantum Hall effect (QHE), and the fractional quantum Hall effect (FQHE), are 
phenomena which occur in highly correlated electron fluids. The QHE was discovered by von 
Klitzing et al. (1980), and the FQHE was discovered by Tsui et al. (1982). 

Electrons are allowed to move in only two dimensions using semiconductor technology 
to confine the motion in the third direction (z). The application of a strong magnetic field B 
along the z direction causes the electrons to move in periodic orbits in the xy plane, which are 
quantized. This quantization leads to energy levels E, = ho,(n + +), where the cyclotron 
frequency is œ, = eB/m*c, and m* is the effective mass of the electron in the semiconductor. 
The center of the cyclotron orbit can be anywhere in the plane. The electrons select the 
centers of their orbits to be arrangements which are highly correlated. At certain densities the 
electron gas becomes quite rigid, or incompressible, which is the simple explanation of the 
QHE and FQHE. 

The topic shares many attributes with superfluid helium. Perturbation theory does not 
work. The best wave functions are guessed, and are written down by intuition. The excitation 
spectrum is calculated using Bijl-Feynman theory. 


11.3.1. Landau Levels 


In a large magnetic field, electrons tend to have orbits which oscillate in time with an 
angular frequency given by the cyclotron frequency œ, = eB/mc where B is the magnetic 
field. If the relaxation time for scattering is t, then the behavior of electrons depends upon 
whether œ,t is greater or smaller than one. If œt >> 1 then the electrons form very periodic 
orbits. Their energy levels are quantized in units of h@,, which are called Landau levels. 
These orbits are simple to describe in two dimensions. 
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In the theory of electricity and magnetism, the potentials may be either scalar potentials 
o(r) or vector potentials A(r). The magnetic field is introduced through a vector potential. 
The basic equation is B = V x A. When the magnetic field is a constant there are many 
different ways to choose a vector potential to satisfy this relation. If the magnetic field is in the 
z direction, then the following choices all give the correct magnetic field 


A = B(-y, 0, 0) (11.272) 
A = B(0, x, 0) (11.273) 
A= =(-y, x, 0) (11.274) 


Each different vector potential is a choice of gauge condition. Since the eigenvalues do not 
depend upon this choice of A, they are called gauge invariant. The first gauge is chosen, for 
no particular reason, so that the Hamiltonian is 


| e 2 
H=-Y—Ip.—--_-A )| — B 11.275 
2 om, |p; c (r;) 2 Py l ) 
1 ey,B\* 
oap (eyw a 
j j 


The electron has a spin, and a magnetic moment w;. The last term in the interaction is the 
Pauli interaction between the electron moment and the magnetic field. For electrons in free 
space, the Pauli term and the orbital eigenvalues have similar energies. The QHE and FQHE 
measurements are usually done on semiconductors with very small effective mass m*, such as 
GaAs with m*/m = 0.07. The effective mass should be used in calculating the cyclotron 
frequency @, = eB/m*c. The symbol m in our equations is actually m*. The level splitting 
hw, is greatly enhanced in the semiconductor because of the small effective mass. The Pauli 
energy is altered much less by the band structure, so that it is much smaller than ho,. 
Although small, it is important for discussing the experiments. It is ignored for now, in order 
to concentrate on the Landau term. 

The Hamiltonian does not yet contain interactions between particles. In this case, the 
solution is found for one electron, and the many-particle ground state is composed of elec- 
trons which occupy these one-electron states. Consider the Hamiltonian for one electron, 


nas | (p42?) 4 (11.277) 
~ m| Pt Py l 


The potential energy term in (11.277) contains y as the only space variable. The Hamiltonian 


commutes with p, so the eigenstates can be chosen to be plane waves in this coordinate. Write 
the solution as 


elk 


Wer, y) = = FO) (11.278) 


e! 


ik,x 2 
HwW(x, y) = in (i + 5) salo (11.279) 


C 
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The effective Hamiltonian operating on the y variable is a harmonic oscillator. This obser- 
vation is made clearer by defining some constants, where the electron charge is (e<0) 


__ eB 
o= (11.280) 
h he 
P = = — . 
no, = Tel (11.281) 
Ack 
=—-—*Ž=kPÉ 11.282 
may- | mm yy LEO) (11.283) 
Y=" J 2m 2 Yo? Y i 


The last equation provides the effective Hamiltonian in the y direction in the explicit form of 
the harmonic oscillator equation. Its eigenstates are harmonic oscillator functions ,(y — yo) 
with eigenvalue E, = fiw,(n +4). The solution to the Hamiltonian is 


ik,.x 
W(x, y) = TO — yo) (11.284) 
E, m, = ho,[n +4] +m,uB (11.285) 


The magnetic length scale is /. The last term is for the Pauli interaction, where m, = + L The 
Landau levels are states with different values of n. The lowest Landau level has n = 0. It is 
interesting that the eigenvalue does not depend upon the quantum number k,. The state has a 
high degeneracy, in that many different values of k, have the same eigenvalue. The classical 
picture has the electron going in a circular orbit. The center of the orbit can be located any 
where in the system, without altering the energy. For the quantum system, the value of k, 
relates to the location of the center of the harmonic oscillator. 

A system of one electron will have a ground state with the electron in the lowest Landau 
level (n = 0). The interesting question is how these states are occupied for a set of N, 
electrons. The question can be rephrased: In a system of area A = L,L,, how many electrons 
can be put into the lowest Landau level? The constant yọ is the center of the cyclotron orbit. 
The orbit center must be in the area, which provides the constraint 


0 <y = kÊ < L, (11.286) 
2nn, L 
k, = A <3 (11.287) 
LL, mo 
XY — 
n S > aA = Anax (11.288) 


Although the energy does not depend upon the value of k,, this quantum number has a 
restricted range of value. The requirement of keeping the orbit center in the area constrains 


ny < Nmax: 


Define p(E) as the density of states per unit area. This state density is 


E 8 E, m,) (11.289) 


p(£) = 
LL, n,ny,,Ms 
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where En, m, is the eigenvalue in (11.285). The summation over n, just gives nmax- Then the 
factor of area cancels 
O 
p(E) === E E — Ep) (11.290) 
mO, 
ny = | dEDE ne (E) = 2 D nEn) (11.291) 
mo 1 e B 
— = — = —B = — 11.2 
h 2nl* he Do (11.292) 
v 
v= J np(Enm,) (11.294) 


The density of electrons nọ (#/meter*) can be expressed as a prefactor times the number of 
occupied Landau levels, where the Fermi—Dirac occupation number is np (E). Note that the 
prefactor can be written as m@,/h = B/Q where the quantum of flux is pọ = hc/e. The 
factor of v is called the filling factor and determines the number of occupied Landau levels. 

There are two length scales in the problem. One of them is the magnetic length /. It 
determines the radius of the harmonic oscillator orbits. The other is the average separation 
between electrons, which is called a. Highly correlated liquids usually have short range order 
in the form of a triangular lattice with each electron having six neighbors at a distance a. As 
an example, examine a woodpile stacked with equally sized logs. The area per electron in the 
plane triangular lattice is a?./3/2. Setting this formula equal to the inverse of the electron 
density gives that a? = 2/n)/3. At what value of the magnetic field does the separation 
between electrons a become larger than the orbital diameter? The harmonic oscillator wave 
function for the lowest Landau level is exp[—(y — yo)” /2]. The exponent is —1 at Ay = J/21. 
Taking this to be the radius of the orbit, the separation between electrons is greater than the 
orbit diameter when 


a>2V21 (11.295) 
2 = a’ >8Ê _ 4% (11.296) 
NV 3 nB 
T noo 
—> L22 = n (E) =v 11.297 


The number on the left is about 0.907. The summation over the occupied levels, on the right 
in the above equation, must be a less than 0.9 before the Landau levels are truly separated in 
l ap 2 


space. These values are reached in the FQHE for fractions such as v = ; or $. 


11.3.2. Classical Hall Effect 


The classical Hall effect must be understood before its quantum version can be appre- 
ciated. The measurement geometry is shown in Fig. 11.16. The shaded area is the experi- 
mental sample. A current J is sent through the material, along the direction labeled “x”. The 
voltage change V, is measured between the two leads on top. This configuration is a standard 
four-probe measurement. The voltage contacts are not put at the end of the sample, in order to 
avoid the influence of nonohmic contacts where the current enters and leaves. Another pair of 
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FIGURE 11.16 Experimental geometry for the measurement of the Hall effect. The magnetic field is perpendicular 
to the plane. 


voltage contacts measure the voltage V, across the sample. This voltage is zero in the absence 
of a magnetic field. 

The Hall effect occurs when there is a magnetic field B perpendicular to the plane of the 
sample. Then a nonzero value is found for the voltage V,, which is proportional to both B and 
I. Since the current J is along the x direction, define two components of resistance 


(11.298) 


(11.299) 


For most materials, the magnetic field hardly changes R,,,. The main experimental observation 
is the nonzero value of Ry- 

The physics is simple when considering the motion of the electrons as classical. Assume 
the particles obey Newtonian mechanics. In two dimensions, the magnetic field creates a 


Lorentz force 


mv = e[E + LvxB, -5 (11.300) 
v,B 
mi, = e|E. + 2 = (11.301) 
C T 
. B mv 
mi, = eE, — =] -— (11.302) 


where E is the electric field and t is the relaxation time for scattering. Consider that the 
experiment is steady state, so that the time derivatives are zero. The currents densities are given 
by J, = engv,, J, = engv,. There is no current in the y direction since the devices to measure 
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voltage have very high impedance. Setting v, = 0 in the last equation gives the identity 


E, = — = => 11. 
” c ence L, (11.303) 
L 
V, = JB (11.304) 
0 
eT Nye?t 
v=2E, J.=0F, o= (11.305) 
m m 


The Lorentz force tends to drive the electrons in the perpendicular direction. The flow of 
electrons along the x direction is diverted to the y direction. Since no current can flow in the y 
direction, then a voltage V, must be be built up to force the electrons back along the x 
direction. The voltage is due to an accumulation of excess charge along the top of the device. 
The quantity J, is the current density. In a three-dimenensional system it has the units of 
amperes per area, so that J. = JA) where Ag is the cross sectional area of the sample. In the 
present case Ay = L,t, where t is the thickness of the sample. One then gets in 3D that 
V, = ai (11.306) 
gect 


The Hall effect is very useful in the laboratory for measuring the density nj of current 
carrying particles in the system. It is a relatively easy measurement if a sample of material is 
available which is several millimeters on each side. The Hall measurement gives the density 
of carriers in semiconductors and metals. The result is only simple to interpret if all carriers 
are in the same energy band of the crystal. 

In two dimensions the Hall effect is even easier to derive. The particle density np has the 
units of number of particles per area. The current density J, has the units of amperes per unit 
width. Therefore J, = JL, and the expression for the Hall voltage in 2D is 


LB 
V = = 


y= (11.307) 
0 
B 
x = — (11.308) 
ngec 


This equation does not depend upon any dimension of the sample. Samples of differing 
shapes and size give the same measurement. 
The classical Hall effect has two characteristics: 


e The longitudinal resistance R, is independent of magnetic field. 
e The transverse resistance R,, is proportional to magnetic field. 


These characteristics change in the Quantum Hall Effect. 


11.3.3. Quantum Hall Effect 


A quasi-two-dimensional electron gas can be created in semiconductor microstructures. 
There are several ways to accomplish this feat, and most of them have shown the QHE. One 
system is the channel of a field-effect transistor. The gate voltage is used to confine the 
electrons in the direction perpendicular to the plane, where the barrier oxide traps them along 
the interface. The electrons are free to move in the plane, and their motion is largely two 
dimensional. In order to observe the QHE (quantum Hall effect), the semiconductor sample 
must have exceptionally high purity and a large value for the mobility. Experiments originally 
showed the QHE in silicon MOSFETs. Another kind of semiconductor microstructure is a 
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FIGURE 11.17 Quantum Hall effect and fractional quantum Hall Effect in GaAs. (Stérmer, 1999 used with 
permission). 


quantum well. It is formed by growing n-layers of one semiconductor, such as GaAs, between 
another semiconductor such as AlAs. Since AlAs has a larger energy gap, electrons in the 
conduction band of GaAs are confined in the direction perpendicular to the layers. They move 
readily along the layer, so the motion is quasi-two-dimensional. Since GaAs and AlAs have 
the same lattice constant, they can be alloyed continuously. The barrier layers are often 
Al,_,Ga,As. Most measurements have been done in single quantum wells of GaAs, although 
the phenomena has been observed in a few other semiconductors. 

A typical result for the QHE and FQHE is shown in Fig. 11.17, which shows the 


resistivity as a function of the magnetic field B. The two resistances have the following 
features: 


e The transvere resistance R,, = Ry develops plateaux at values of Ry = Ro/i, where 
Ry = h/e? © 25812.8Q, and i is an integer or simple fraction. The QHE is when i is 
integer, the FQHE is when i is a fraction. 

e The longitudinal resistance R,,, = R —> 0 during these plateaux. 


The quantity Rp is a combination of fundamental constants. It has become the world standard 
for maintaining electrical resistance. Since the resistance steps appear at the same values, 
regardless of the shape of the sample, any laboratory in the world will find the steps at the 
same numerical values. 

In the earlier discussion of Landau levels, an expression was derived for the density of 
electrons in the plane. Using that expression (11.291) gives 


1 mec æ 
z 2i (11.309) 
v= >, Nr[BE nm, — L)] (11.310) 
R 
Ry =— (11.311) 
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The QHE has a simple explanation if the function v has integer values. The FQHE requires 
that the filling factor v be a simple fraction. 

The key to understanding the QHE is to understand the behavior of the function v as the 
magnetic field in increased. The important parameter is the chemical potential u. How does it 
change with magnetic field? There are three different models for the behavior of the chemical 
potential in these systems. 


11.3.3.1. Fixed Density 


The first model assumes that the density of electrons n is fixed in the sample. When the 
magnetic field B is varied, the chemical potential must also be varied so that the density nọ is 
unchanged. When considering a function such as (11.291), in order to keep the nght-hand 
side of this equation a constant as B is varied, the chemical potential must go through some 
gyrations. The product mjd) = By has the units of magnetic field. For small values of 
B, (B & Bo), the summation over n needs many terms. Then the chemical potential varies 
with magnetic field in a manner which is periodic in 1/B. This behavior is well known, and 
shows up experimentally in de Haas—van Alphen, Shubnikov—deHaas, and other measure- 
ments of these oscillations. However, as B increases and becomes B > Bo then all electrons 
are in the lowest Landau level. The requirement of constant density gives the expression for 
the chemical potential (neglecting the Zeeman term) 


B 


Bo © Tita. 2 4 1 (11.312) 
B 
jm = 5ho, ks in( = 1) (11.313) 


The variation of the chemical potential with magnetic field is an important feature of electrons 
in large magnetic fields. 

This model is applied to most three-dimensional solids. There the number of electrons is 
fixed by considerations of charge neutrality. 


11.3.3.2. Fixed Chemical Potential 


The second model for the chemical potential assumes that it does not change with 
magnetic field. Note that in the experimental measurements, there are at least five metallic 
wire leads soldered to the sample. These metal contacts form a reservoir of electrons which 
could fix the chemical potential in the device. The two-dimensional device has only about 
10!! conduction electrons. That is a very small number compared to the number of electrons 
in the metallic reservoirs. The change in energy of these small number of electrons has a 
negligible contribution to the energy of the system. The chemical potential is thereby fixed. 
The assumption of a fixed chemical potential certainly applies to MOSFET (“Metal-Oxide- 
Semiconductor-Field Effect Transistor”) devices. The usefulness of the gate voltage is that 
one can use it to modify the density of electrons in the conducting channel. The electrons in 
the channel have a negligible effect on the chemical potential, which is determined by the 
source and drain. 

In the case that u is a constant, then the function f will have plateaux at low temperature 
when plotted as a function of magnetic field. 
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Figure 11.18 shows a plot of the series 


which is the number of electrons in Landau levels, of a single spin component, as a function 
of chemical potential. When ha, >> kgT there are plateaux in the occupation number f. 
Figure 11.18(b) shows the value of nọ as the magnetic field is varied. Note that nọ — 0 as the 
lowest Landau level exceeds the chemical potential (ha, > pu). This prediction is in direct 
contradiction to the experimental data, which shows that nọ is relatively constant at high 
magnetic field. Note that the resistance R, tends linearly upward in Fig. 11.17, except for the 
plateaux. The linear behavior is only possible when np is relatively constant. 

The model with constant chemical potential explains well the QHE (Yennie, 1987). The 
experimental plateaux are very well fit by this model. The plateaux appear exactly as in Fig. 
11.18, which plots f, while the experiments measure 1 /f. However, this model fails to explain 
the FQHE, and fails to explain the data at large magnetic field which shows that np is rela- 
tively constant. 


11.3.3.3. Impurity Dominated 


The third model for the chemical potential invokes localization caused by static impu- 
rities (Prange, 1987). All semiconductors have impurities. They tend to bind the Landau 
levels near potential mimima in the plane. These minima are due to impurities in the 
underlying substrate. In one dimension, all states are localized by any impurity scattering. In 
the QHE, the system is two dimension. However, one quantum number becomes localized 
into a harmonic oscillator state. The second quantum number (k,) defines a type of one- 
dimensional motion. The particles behave, in some respects, as if they were in one dimension. 
However, their bound states are not given by a Hamiltonian in one dimension, since there is 
no kinetic energy term associated with the motion in the x direction. 

The argument in favor of localization caused by impurities, as controlling the chemical 
potential, is that as the experimental samples are made with higher purity, the plateaux 
become narrower. Inpurities are essential for pinning the value of chemical potential, and 
thereby observing the FQHE. 

The fractional quantum Hall effect (FQHE) is the appearance of additional plateaux at 
noninteger value of the function v. Tsui et al. (1982) discovered that increasing the magnetic 
field, beyond the plateaux for v = 1, produced additional plateaux which appear at fraction 
occupation such as i or 2, The experimental technique continues to improve, with samples of 
higher purity, at larger magnetic fields, and at lower temperature. Figure 11.17 shows one of 
their recent experimental result on a sample of very high purity. Numerous fractional plateaux 
are apparent. 

The plateaux at fractional filling are due to electron correlation. The theory of this 
phenomena shares some features with liquid helium. It is impossible to describe the correlated 
states with any kind of perturbation theory. The theory has no small parameter on which to 
base a perturbation expansion. Instead, progress has occurred by writing down ad hoc wave 
functions, and seeing what they give. The best wave function is described in the next section. 
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FIGURE 11.18 (a) Number of electrons in Landau levels as a function of magnetic field for a fixed chemical 
potential. (b) The density of electrons vs. magnetic field, in units of 10!° m~?. Parameters are T=0.1 K, m* = 0.07, 
no = 0.47 x 10? m7, and p = 30 pEp,. 
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11.3.4. Laughlin Wave Function 


Laughlin proposed a wave function for the FQHE which has become the standard basis 
for all theories. It is discussed in this section. As with all wave functions in this chapter, it is 
written down by intuition, and contains correlation. The correlated wave function is not 
derived by any form of perturbation theory. It is just written down, and its properties are 
studied. 

The first step in the derivation is to solve the Hamiltonian for the symmetric gauge 
(11.274). For a single electron, the Hamiltonian in this gauge is 


1 BY 1 B 7 
H= pt Z| tl -Zy (11.315) 
2m 


The space variables are put in dimensionless form using x’ = x/l, y = y/l where the magnetic 
length is Ż = hc/eB. After some algebra, the Hamiltonian is rewritten as (dropping the 


primes) 
ho ð y ? a xV 
—~ ie} fo 47 —-—L 11.316 
H 2 (E) + (a 5) | N ) 
ho > l; 
= —*=|— —p* — 11.3 
5 V +7P L ( 17) 
0 0 0 
— —il ,— —y—) = -j— 11.318 
É (z 5) E: (11.318) 


where tan(þġ) = y/x and p? =x? +y? in polar coordinates. Consider the eigenfunction 


Vin(P, ) 


Vn(P, >) = Npp” et ee (11.319) 
Hy, = a a (11.320) 
i 
N, =———— 11.321 

~y 2t%m!2™ ( ) 

oO 21 
Òm = l pap | LOW Vm! (11.322) 

0 0 


The eigenfunctions Ų,„ have an eigenvalue of Aw,/2 which indicates that the electron is in the 
lowest Landau level (n = 0). The quantum number m indicates angular momentum, since 
LW, = my. The value of angular momentum m does not alter the eigenvalue. In the prior 
section the silent quantum number was k,, which also did not affect the eigenvalue. In the 
symmetric gauge the silent quantum number is m. The last expression shows that these 
functions are mutually orthogonal, because the integral over dd vanishes unless m = mw. 

In the FQHE, the high value of magnetic field at low temperature ensures that 
ho. >> kT. All electrons are in the lowest Landau level for the case that the fractional filling 
v<l. Then the eigenfunction ,, is the basis for all electrons in the ground state. An 
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interesting question is to ask where are the electrons? Use Stirling’s approximation to the 
factorial to write the electron density for large m as 


Pm(P) = Wl” = =a, pre? (11.323) 
N KS exp] — 4 +mln(p)+m-— m In(2m) (11.324) 
~ iep - 2m) +m nS) (11.325) 
N G — 2m) + min( 1 + p? z”) 


N l l-3, 2 
X T] S7 (p — 2m) | (11.326) 
For large value of angular momentum m, the peak in the electron density occurs at p ~ /2m. 
The electrons are in a circle, where m = 0 is the orbit at the center of the circle, and higher 
values of m are further from the center. 

The above discussion is for a single electron. The next step is to construct a many- 
electron wave function. In this case it is useful to describe the position vector (x, y) by the 
complex number z = x + iy = pe’®. The basis set is now written as 


Vn) = Nz” exp(—lz|?/4) (11.327) 


In the Hartree-Fock approximation, the wave function for a many-electron system is a Slater 
determinant. All electrons are assumed to have the same spin direction since they are aligned 
by the magnetic field. If they have spin up, the eigenfunction is «, for the j-th electron. Each 
electron must be in a different state, so that they occupy the angular momentum states 
m=0,1,2,...,M—1. Then the Hartree-Fock wave function for N electrons is 


WolZ1) WED WVE) = Yy) 
Yoz) YVE) WE) = Waie) 
WP y(Z1,205---,2Zy) = = | Yo (z) Wy(Z3) ye) = Yy) 


al- 


Wolzy) VEn) WolZy) = Vy_-1@w) 


X Oly ++ Oy (11.328) 
1 1M, 
= yp NP +> ‘AN oo] 35 iz; | (11.329) 
Lozo zgo o ZN 
loz z < Nol 

Dy=|1 3 3 ooe BO (11.330) 
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The determinant is due to Vandermonde, and is exactly equal to the product of the difference 
for all pairs of coordinates 


1 LN 
Py (21, Z2- Zy) = Jy Mk Iy: — z) ++ ay a| -32 a (11.331) 
. J= 


The first product is the normalization coefficients. The second product contains (z; — z;) for 
all pairs of electrons. This factor ensures the many-electron wave function is anti-symmetric 
under exchange of any two electrons, as is required for fermions. 

The wave function Yy can be used to calculate some properties of the many-electron 
system. Squaring it, and then integrating over all but one coordinate, gives the average density 
of one electron. Using the theorems in Sec. 5.1.9, and the fact that the eigenfunctions are 


orthogonal, gives the result 
eP 2/2 p?” 


2nN 2 2mm! 


piP) => SE Val? = (11.332) 
Again the notation p, means the density of one particle in the many-particle system. For small 
values of p, the summation gives exp(p?/2) which cancels the other exponent, and 
p, ~ 1/(2nN). The density of N such electrons, after restoring the factors of length /, are 


Ny = Np, © (11.333) 


1 
2rl? 
Equation (11.293) has almost the same expression except the right-hand side is multiplied by 
v. Equating these two expressions gives that v = 1. The many-particle wave function was 
chosen to fill up the lowest states of angular momentum. This case corresponds to the 
maximum packing of electrons in the lowest Landau level, which is a filling factor v = 1. 

The pair distribution function g(r) is found from p,(r,, r2), which is obtained by inte- 
grating over all but two coordinates. Since the functions are orthogonal, p, is found from all 
pairs of — wave functions 


Pa(ri, r2) = WOT NOV y Melee) — Wilr yr)? (11.334) 
conn, N-1 1] 7 
= SGN oI) > DA (Po + pi/p#! — 2(01p2)"" cosl(i — /N(, — ba) 


(11.335) 


For small values of p; > the series converge to an exponential. In this case the pair distribution 
function is 


1 
P(r], r2) = nN(N — 1) [1 — exp(—+[pi + p3 — 2p; Pz cos(ọ; — b2)])] 
i (11.336) 
g(r 12) = 1- ep -zg =? | 
S(k) = 1 + no [Pno — 1ļ]e™" = 1 — exp[—k° f /2] (11.337) 


Note this pair distribution function is for two particles in a system of N > 1 particles. A 
different result is obtained for only a few particles. One can also derive the three- and four- 
particle distribution functions (Jancovici, 1981). The static structure factor S(k) is also given. 
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Note that it goes to zero as O(k*) rather than as O(k) as in a fluid of neutral particles. These 
formulas apply only for the case of v = 1. 

In Chap. 5 the exchange energy per electron of the electron gas is provided by the 
formula 


E, = -2e h — g(r)] (11.338) 


where it has been adopted to two dimensions by changing dîr —> d?r. Using the above 
expression for g(r), and nọ = 1/(2n/*) for v = 1, the exchange energy is (r = p//) 


e °° r2 e? TU 
E = — -7 fa — , 
x J | dre ane (11.339) 


This formula is valid for filling factor v = 1. For fractional filling, the exchange energy in the 
Hartre-Fock approximation is written as E, = —C,e’/I. Using the Laughlin wave function, 
described below, gives the following values for the dimensionless constant: 


1/v C, 


1/1 0.627 
1/3 0.416 


1/5 0.334 


The Laughlin wave function for the FQHE was written down by intuition starting from 
Eq. (11.331). It is 


N 
Y(z,, i Zy) = Np U5, i(Z; — z Oy s.. AN ap] - roe st | (11.340) 


where the normalization constant is N,,. The factor of (z; — z;} is the difference between the 
coordinates raised to a power of p. The fermion wave function has to change sign under the 
exchange of any two coordinates. This fundamental antisymmetrization is achieved as long as 
p is any odd integer. The case with p = 1 is the integer QHE with v = 1 in (11.331). Other 
values such as p = 3,5, 7--- provide an accurate wave function for the FQHE. The Laughlin 
wave function cannot be derived from a single Slater determinent and is not a Hartree—Fock 
wave function. It contains correlation and is a highly correlated wave function. 

The exponent p is related to the filling factor p = 1/v. The choice p = 3 gives a wave 
function for the fractional state v = L, The proof of this assertion starts by examining the 
density matrix 


PulTi, fz,- ry) = ECZ, Z, Zy)? (11.341) 


= Noy M; |r; — r|? a| 30 z] (11.342) 


ovg 
p 


= Mue p 2 ln lr; — rj" — E7 (11.343) 
i>j j 


The spin eigenstates have been eliminated. Also note that 


|Z, — z|? = (xı — x) + ( -y = (r; — r2)”. (11.344) 
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The density matrix can be cast into the form for a classical fluid p, ~ exp[—BV] where the 
potential energy BV = U has the form 


U(r,,...,¥y) = D -pð lnj|r;— (11.345) 


i>j 


The potential has two types of terms. Each electron has a potential 77/2 which tends to 
confine it to the origin. There is also a particle—particle interaction of the form of a logarithm. 
It is characteristic of the two-dimensional Coulomb gas, where the particles are not points 
with V = e? /rj, but instead are charged rods perpendicular to the plane which have 
V =e In(r;). The same potential is found for interacting vortices. 

For a system of N electrons, the two-dimensional fluid forms a circle of radius 
R ~ J2N. The electrons near to the center of the circle are in equilibrium, which means that 
the force on them should be zero. For a uniform circle, the force can only be radial. The radial 
force on one of them, say r4 is 


Fin) =-= -r +P D> ri 27) coslo) (11.346) 


jari +17 — 2r,r; cos >; 


where , is the angle between r; and r;. The last term is evaluated by changing the 
summation to an integral, assuming the particles have a uniform density of no 


>, 2r — 2r, cos(o;) 
F,r) = -r + pn | dr HAFA Inr,c0s6, (11.347) 
RA 
— p Ee | 72, ee 11.348 
1 H ry "f is R E sg l ) 
The angular integral is done using 
2n 
= dọ 21 
=== O(A? — B’ 11.349 
| A+Bcosh / A2 — B2 ( ) ( ) 
on Rr 
F (ri) = -r + rm] rar) + a 5 | (11.350) 
ri |r ry -r | 


The bracket in the integrand is zero ifr; >r, and is two ifr; <r,. In fact, the above expression 
is Gauss’s law for the two-dimensional Coulomb gas. The force comes only from the charge 
inside of the circle of radius r,. The remaining integral is elementary, 


0 


The density nọ = v/2n in units where all dimensions are scaled by the magnetic length /. 
Then the factor in brackets is [1 — pv]. The requirement that the force on particle 1 be zero 
sets p = 1/v. Since p is an odd integer, then the Laughlin wave function describes the 
fractional fillings of v = 1/p such as = L, i L etc. Numerous numerical tests have shown that the 
wave function describes these states extremely well. 

The wave function for p = 1 is the Slater determinant which gives (11.331). By filling up 
all states m = 0, 1,2,...,N — 1, the system has a uniform density given by nọ = 
1/(2n/7). The Laughlin wave function for v = 1/p also has a uniform density, on the average. 
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It is interesting to calculate the current density associated with the Laughlin ground state. 
First calculate the current from a typical electron, such as z,, using the standard quantum 
mechanical expression 


en x . 
J, = api FV P — P*V Y} (11.352) 
vY=Ħ|-* ‘lip Baws (11.353) 
2 i=2 Z1 
h — y) — Wx, —x. 
J, — _ pen p]? SŽ (vı yi) Yer x;) (11.354) 
m i= (rı — r;) 


The summation over i can be changed to a continuous integral $`, > no f d?r. The factor of 
[F]? supplies an additional factor of 1/A, where A is the area. The current from particle 1 is 


Prt far — y;) — WX, — x;) 
mA (rı — r;) 


The integral is zero for an infinite system. The result is nonzero for a finite system consisting 
of a circle of radius R and area A = nR?. The angular integral have the same form as 
discussed in Eq. (11.347), so that the integral over dr; is constrained to values r; <r. The 
result is 


J, = (11.355) 


epnoh 
2mA 


J, = [xy — yx] (11.356) 
The current is rotational. This result should not be surprising, since the lowest Landau level is 
composed of eigenstates with many values of angular momentum, which are all rotating in the 
same direction. Also note that the current obeys the equation of continuity V- J = 0. 


11.3.5. Collective Excitations 


Several collective excitations have been identified for the QHE and the FQHE. Two are 
discussed here. 


11.3.5.1. Magnetorotons 


Girvin et al. (1986) used the Bijl-Feyman theory to calculate the frequency dependence 
of the density fluctuations in the QHE and the FHQE. The method is identical to that used 
earlier in this chapter for excitations in He. However, the derivation for the FQHE has some 
new wrinkles. These excitations are called magnetorotons. 

The excitation of wave vector k is obtained by multiplying the ground state wave 
function Y, by the density fluctuation p(k) which gives an excited state ‘Py 


p(k) = ne (11.357) 


=== p(k), (11.358) 


F= a 


o(k) = (¥(H — Eo) Fy) = wa ¥elP' (KH, pI.) 
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As in the derivation for *He, the only term which fails to commute with the density operator is 
the kinetic energy. Repeating that earlier derivation gives the same result 


Kk 
~ 2mS(k) 
This result is quite universal. It applies to any system, and to any fractional filling. 
First examine the case v = 1 where the lowest Landau level is completely filled. The 
structure factor for that case is given in (11.337). 
E 
2m 1 — e-KP/2 


At large values of k the excitations go to the free-particle kinetic energy w(k) = e,. At small 
values of k the result is 


olk) (11.359) 


o(k) = (11.360) 


, h 


The excitation energy is just the cyclotron energy iw,. Of course, this makes physical sense: 
if the lowest Landau level is filled, the excitation is to the next Landau level which is (n = 1). 
This result is also in agreement with a theorem of Kohn (1961) which states that the magneto- 
phonon excitation energy is always hw,, regardless of the filling factor, and regardless of the 
interactions. 

Girvin et al. (1986) remark on the accuracy of the Bijl-Feynman formula. Here it gives 
the exact excitation energy, while for *He it gave a very approximate excitation energy. They 
remark that the current from this excitation satisfies the equation of continuity (V-J = 0), 
while it does not when using the correlated basis function for the ground state of *He. They 
also remark that the theory assumes a Single Mode Approximation. All of the excitation 
spectra is in the one state. Most fermion systems have a collective mode (e.g., plasmons) plus 
electron—hole excitations. At long wavelength the collective mode dominates. Then the single 
mode approximation is an accurate description of the excitation. 

Girvin et al. predicted the existence of a new low energy collective excitation for the 
FQHE. The first guess is to use the Laughlin wave function for the ground state Y, for 
p = 3,5, etc. According to Kohn’s theorem, the calculated excitation energy will still be ha,. 
This large excitation energy is obtained because the density operator p(k) has contributions 
from many Landau levels. However, the low lying excitations from the Laughlin state are 
probably to states which are different configuration of the lowest Landau level. So they used a 
projection operator P,, which projects out the states for the Landau level n. In their case, they 
only wanted P). Both numerator and denominator are evaluated in this reduced Hilbert space 


_f®) 
ok) = (11.362) 
p(k) = Pop(k)Po (11.363) 
5E) = = (P IB OOE) (11.364) 
FO = È (PAPOLA, BODY, (11.365) 


The evaluation of this expression required numerical work such as Monte Carlo. They were 
able to prove some important results at long wavelength. They showed that for both the v = 4 
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and v = } states that w(0) 4 0 and the excitation spectrum has a gap. The constant result for 
k — 01s not due to plasmon effects. In two dimensions, the plasmon without a magnetic field 
goes as œ?(q) = nge*q/m and vanishes at long wavelength. The gap at long wavelength is due 
to the rigidity of the Laughlin state. For v = 4 both f (k) and S(k) are proportional to k* at long 
wavelength, so that their ratio goes as a constant. 

Figure 11.19 shows the numerical predictions for the low energy excitation. In Fig. 
11 19(a) are S(q), S(q) for the state v = i In (b) is shown the excitation energy w(k) for 
v= L, Ł There is a minimum at nonzero wave vector, which is similar to the roton in 
superfluid *He. The name magnetoroton has been applied to this low energy excitation at non- 
zero wave vector. At large wave vectors the neutral excitation is a quasiexciton consisting of a 
bound state of a quasielectron and quasihole. 


S(q) 


0.20 


0.05 


0.0 0.5 1.0 1.5 2.0 


ql 


FIGURE 11.19 (a) S(q) for states v = +. The top line is S(q) and the lower line is S(q). (b) a(k) for the states 


v = 4,4. From Girvin et al. (1986) (used with permission). 
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11.3.5.2. Quasiholes 


Laughlin (1983) considered the excitation created by adding a “quasihole” or 
“quasielectron”. The easiest to understand is the quasihole. It is a place in the fluid where 
other electrons cannot be located. Denote the Laughlin wave function (11.340) as ¥ = |p, N) 
for p = 1/v and N electrons. The wave function for the fluid with a quasihole at position Zp is 


Yv({Z;}; Zo) = W(Zo) |p, N) (11.366) 
W (zy) = T(z; — zo) (11.367) 


The factor of W (zo) ensures that no electron of coordinates z; will be found at the position Zp. 
For the case that p = 1, v = 1 then this orbital factor is equivalent to adding another electron 
to the system at position zọ. The ground state energy is that of a system with N + 1 electrons 
in the lowest Landau level. The cost in energy from adding the quasihole, in Hartree-Fock, is 
AE, = ho,/2 — J/n/8 e?/I. 

For fractional filling the properties of this excitation must be determined by numerical 
methods. The probability is found by taking the absolute square of the wave function. The 
electron density p;(r;, rọ) is found by integrating this probability over all but one space 
variable. This density must be uniform except near to the point rọ, where it has the form of a 
pair distribution g(r — rg) 


PO) ++ ry; ro) = [Py ({zi}: zo)I (11.368) 
| Pdr, ---dry 

g(r — ro) = N | Pdrdr;dr; - - -dry (11.369) 

q = e [a?r — g(r — ro)] (11.370) 


The quantity q measures the amount of electron charge which is excluded from the vicinity of 
the quasihole. It is found that q = ev. The quasihole has fractional charge. 

The evaluation of the above integrals again uses the analogy with interacting fluids. 
Write P as a Jastrow wave function: 


P = N(zọ)exp(—U) (11.371) 
U = -p © hlr, - r, += Di - È 1n |r; = rol’ (11.372) 
i>j 


where N (zo) is the normalization constant. The fluid analogy has N particles of “charge” ./p 
interacting among themselves with logarthmic forces. An impurity at rọ of charge 1/ Jp 
interacts with the other electrons. The resulting pair correlation function is solved using the 
theory of interacting fluid mixtures, where one component is very dilute. The energy needed 
to insert the quasihole into the fractional ground state is E, = —C, e”/1. Numerical work 
shows that C, = 0.026 for v = 1 3 For a typical magnetic field of B=5 T, this excitation energy 
in temperature units is about 6 K. Experiments must be performed at a much lower 
temperature than this value, in order that quasihole excitations be thermally quenched. A 
similar energy scale is found for the magnetorotons. 


Problems 761 


Consider the additional current due to multiplying the Laughlin wave function by a sum 
of phase factors 


® = exp [ 2, vin) |¥ (11.373) 
(5J) = no Tor) (11.374) 


The additional current is given by the gradient of the phase factor. If the quasihole is located 
at Z) = 0 then its factor has the above form 


W = II,z = exp È mire") | (11.375) 

id(r) = lnr; + i0 (11.376) 

(5J) = noñ 8 (11.377) 
mr 


The additional current from the quasihole is also rotational. Girvin (1987) points out that the 
quasihole has the form of a vortex flow. 

In many two-dimensional systems the low energy excitations are vortices. The statistical 
mechanics of the disordering process are dominated by the binding together of vortices of 
opposite rotational motion. This phenomena was first described by Kosterlitz and Thouless 
(1973). However, the vortices in the QHE and FQHE are different than those of Kosterlitz and 
Thouless, and their theory does not actually apply to the present system. These differences are 
elucidated by Girvin (1987). 

The Laughlin wave function explains quite well the states with filling v = 1/p where p is 
an odd integer. The experimentals show many other fractions, and these are not explained by 
the Laughlin wave function. There are various schemes to explain these states, such as one 
proposed by Jain (1990). Another interesting feature is that the higher Landau levels show 
striped phases at very high magnetic field (Lilly et al., 1999). 


PROBLEMS 


1. Derive the formula for the Bose-Einstein transition temperature T, of a gas of bosons in two 
dimensions. 


2. In the Bogoliubov theory of superfluid tHe, derive the terms in the interaction potential which 
contain three C operators whose wave vectors are not zero. Identify the physics behind each different 
kind of term. 


3. Use CBFs to derive the McMillan result that the average energy per particle is 


2 
= = Bereta] re + va) (11.378) 
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4. Use the result (11.91) for S(Q, œ) to evaluate the moments (T = 0) 


| - © sen(o)S(O, o) = 2S(Q) (11.379) 
œ do 

i i 2. o) =? (11.380) 

i 20'S, ©) =? (11.381) 


5. Consider the case where the liquid helium is flowing down the pipe with velocity v,, and the pipe 
walls are moving with velocity v,. Derive an expression for the total momentum as a function of 


Pss PN> Ugs Uy- 


6. For the superfluid *He flowing through a pipe, derive an expression for the total energy to order 
O(v). Show it is not exactly given by the superfluid density p,. 


7. Use the paramagnon Hamiltonian (11.208) to evaluate the self-energy of quasiparticles by summing 
the set of diagrams in (a) Fig. 11.20(a) and (b) Fig. 11.20(b). 


08 8 


FIGURE 11.20 


8. Use the paramagnon theory to study the effective interaction between quasiparticles. By summing 
the diagrams in Fig. 11.21, show that the effective interaction can be written as 


I l o'o 
Vett =5 (= ZIPO TIPO i) (11.382) 
FIGURE 11.21 


9. Calculate the quasiparticle self-energy in normal *He arising from the diagram in Fig. 11.22, and 
use the result to find the quasiparticle lifetime. Show that the result is (11.221) but without the factor 
1 — ny. 
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FIGURE 11.22 


10. Use the result of Problem 7 [or Eq. (11.221)] to find the numerical value of the mean-free-path of a 
dressed quasiparticle in He at €, = 0 at T = 1 mK. 


11. Use Eqs. (11.209) to calculate the thermal conductivity and verify the result kgT = 33 erg/cm-s in 
Table 11.4. 


12. Find the zero-temperature gap function for the solution (11.266) to the gap equation for triplet 
pairing. 


13. Show by explicit calculation that the choice (11.269) for the BW orbital alignment has the same 
gap function A(p) as the choice (11.256). 


14. Calculate the pair distribution function g(r, — r,) in a system of only three particles using Eq. 
(11.331) for the case v = 1. 


15. Use Eq. (11.331) for N > 1 to calculate the three-body distribution function g(r,, r», r3) for the 
case that v = 1. 


16. Use the Hartree-Fock wave function for the QHE with v = 1 to calculate the ODLRO R(p, p^). 
17. In the QHE for v = 1, evaluate the partition function to derive the thermodynamic potential Qg. 


18. Show that the Laughlin wavefunction can be written exactly as 


Y = MI, 6"? | Ocu (x a) (11.383) 

l 
Dcy(Z) = exp| — ay zr | (11.384) 
u(z) = —p ln(z) + i |z|? (11.385) 


If there is no center of mass motion ($` z; = 0) then the wave function has exactly the Jastrow 
form. Use this form to show the denśity is uniform. 
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